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Abstract 

In this paper, the notion of strongly typed language will be borrowed from the field of computer 
programming to introduce a calculational framework for linear algebra and tensor calculus for the purpose 
of detecting errors resulting from inherent misuse of objects and for finding natural formulations of various 
objects. A tensor bundle formalism, crucially relying on the notion of pullback bundle, will be used to 
create a rich type system with which to distinguish objects. The type system and relevant notation 
is designed to "telescope" to accomodate a level of detail appropriate to a set of calculations. Various 
techniques using this formalism will be developed and demonstrated with the goal of providing a relatively 
complete and uniform method of coordinate-free computation. 

The calculus of variations pertaining to maps between Riemannian manifolds will be formulated using 
the strongly typed tensor formalism and associated techniques. Energy functionals defined in terms of 
first order Lagrangians are the focus of the second half of this paper, in which the first variation, the 
Euler-Lagrange equations, and the second variation of such functionals will be derived. 

Introduction 

Many important differential equations have a variational origin, being derived as the Euler-Lagrange equa- 
tions for a particular functional on some space of functions. The variational approach lends itself particularly 
to physics, in which conservation of energy or minimization of action is a central concept. The naturality of 
such formulations can't be understated, as solutions to such problems often depend critically on the inherent 
geometry of the underlying objects. For example, solutions to Laplace's equation for a real valued function 
(e.g. modeling steady-state heat flow) on a Riemannian manifold depend qualitatively on the topology of the 
manifold (e.g. harmonic functions on a closed Riemannian manifold are necessarily constant, which makes 
sense geometrically because there is no boundary through which heat can escape). 

A central concept in the field of software design is that of information hiding [ ], in which a com- 
puter program is organized into modules, each presenting an abstract public interface. Other parts of the 
program can interact only through the presented interface, and the details of how each module works are 
hidden, thereby preventing interference in the implementation details which are not required by the inherent 
structure of the module. This concept has clear usefulness in the field of mathematics as well. For example, 
there are several formulations of the real numbers (e.g. equivalence classes of Cauchy sequences of rational 
numbers, Dedekind cuts, decimal expansions, etc), but their particulars are instances of what are known as 
implementation details, and the details of each particular implementation are irrelevant in most areas of 
mathematics, which only use the inherent properties of the real numbers as a complete, totally ordered field. 
Of course, at certain levels, it is useful or necessary to "open up the box" [go past the public interface] and 
work with a particular representation of the real numbers. 

Information hiding is characteristic of abstract mathematics, in which general results are proved about 
abstract mathematical objects without using any particular implementation of said objects. These results 
can then be used modularly in other proofs, just as the functionality of a computer program is organized into 
modularized objects and functions. For example, a fixed point theorem for contractive mappings on closed 



sets in Banach spaces, but a particular application of this theorem renders an existence and uniqueness 
theorem for first order ODEs [ U\ pgs. 59, 62]. 

A loose conceptual analogy for modularity is that of diagonalizing a linear operator. A basis of eigen- 
vectors are chosen so that the action of the linear operator on each eigenspace has a particularly simple 
expression, and distinct eigenspaces do not interact with respect to the operator's action. In this analogy, 
the eigenvectors then correspond to individual lemmas, and the linear operator corresponds to a large theo- 
rem which uses each lemma. Decomposing the proof of the main result in terms of non-interacting lemmas 
simplifies the proof considerably, just as it simplifies the quantification of the linear operator. The term 
"orthogonal" has been borrowed by software design to describe two program modules whose functionality is 
independent [IS, Chapter 4, Section 2]. Orthogonality in software design is highly desirable as it generally 
eases program implementation and program correctness verification, as the human designers are only capable 
of keeping track of a certain finite number of details simultaneously [13]. The scope of each detail level of 
the design is limited in complexity, making the overall design easier to comprehend. 

This technique in software design carries over directly to proof design, where it is desirable (elegant) to 
write proofs and do calculations without introducing extraneous details, such as choice of bases in vector 
spaces or local coordinates in manifolds. Because such choices are generally non-unique, they can often 
obscure the inherent structure of the relevant objects by introducing artifacts arising from properties of the 
particular details used to implement said objects. For example, the choice of a particular local coordinate 
chart on a manifold artificially imposes an additive structure on a neighborhood of the manifold, but such 
a structure has nothing to do with the inherent geometry of the manifold. Furthermore, the descent to this 
"lower level" of calculation discards some type information, representing points in a manifold as Euclidean 
vectors, thereby losing the ability to distinguish points from different manifolds, or even different localities 
in the same manifold. 

This paper makes a particular emphasis on natural formulations and calculations in order to expose 
the underlying geometric structures rather than relying on coordinate-based expressions. The construction 
of the "full" direct sum and "full" tensor product bundles are used in combination with induced covariant 
derivatives to this end. 

Notation and Conventions 

Let all vector spaces, manifolds and [fiber] bundles be real and finite-dimensional unless otherwise noted (this 
allows the canonical identification V** = V for a, vector space or vector bundle V), and let all tensor products 
be over R. The unqualified term "bundle" will mean "fiber bundle". The Einstein summation convention will 
be assumed in situations when indexed tensors are used for computation. 

Unary operators are understood to have higher binding precedence than binary operators, and super and 
subscripts are understood to have the highest binding precedence. For example, the expression VX m o (p 
would be parenthesized as (V (X_m)) o 

Apart from the obvious purpose of providing a concise and central reference for the notation in this 
paper, the following notation index serves to illustrate the use of telescoping notation (see Section 2). The 
high-level (terse notation which requires the reader to do more work in type inference but is more agile), 
mid-level, and low-level (completely type-specified, requiring little work on the part of the reader) notations 
are presented side- by-side with their definitions. 

Let / C R be a neighborhood of 0, let e,i each be coordinates on /, let A,Ai, . . . ,An,B be sets, let 
M,N be manifolds, let cj) e C°° {M,N), let 7rf^ : A M and tt^ : H ^ N and be vector bundles, where 
E,F,Fi,...,Fn,G,let U,V,Vi,...,Vn,W he vector spaces, and let Cj e F {F^ (E)m T*M) such that 

e F {{Fi ®M • • ■ ©M Fn) <E)M T*M) 

is a vector bundle isomorphism. 
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High- 


Mid- 


Low-level 


Description 


Variations; variational derivatives; tangent vectors. 




me 


m 


Variation of a point in M; I B t ^-^ £ M; m: I M. 




6 


Se 


Variational derivative; := <=o. 






5 em 


Tangent vector; linearization of a variation; 
Srue € TmoM; S^m € r„(o)M. 


Projection maps; canonical isomorphisms; bundle-related maps and spaces. 


pr 




prfix •x^" 
p,A,x...xA„ 


Set-theoretic projection onto zth factor or named factor; 

pjAiX-.-xA„ . X • • • X A„ ^ ^i. 


L 


Lb, t 




Canonical isomorphism; : A ^ B; ■= (is) 


TT 


F 

■Km, tt 




Bundle projection map; ttm : F — >• M. 


P 






Fullback bundle fiber projection map; ^ : (f)*!! — > H. 


Trivial bundle constructions and projection maps. 


MxN^N 


M^N ^ N 


Trivial bundle over N; M ^ N := M x N; 








Tv^^^ : M^N ^ N, {m,n)^^ n. 


M X N ^ M 


M^N ^ M 


Trivial bundle over M; M ^ N := M x N; 








TT^*^^: M¥rN ^M, (m,n) H^-rn. 


Shared base-space bundle constructions and projection maps. 


Ex F^ M 


Exm F ^ M 


Direct product; E Xm F := UmsM Em x Em--, 
"^(e,/) ■■=^M {e) = n'M (/)■ 


Ei 


e F ^- M 


E®mF 


Whitney sum; E(BmF := U™6m -Em F^; 
^M®"^{e®f) :=^^ (e) = ^r,(/). 


Ei 




E®M F ^ M 


Tensor product; E ®m F := UmeM Em® Fm; 

^E^mF (^^ij^, ^ f,^ (g^) ^ ^F (fo^ ^j^y 


Separate base-space bundle constructions and projection maps. 


E X H ^ M X N 


E Xmxn H ^ M X N 


Direct product; E Xmxn H := ]J(^ „)eMxJV Em x i7„. 








'^Mx"'""(e,/i) := {^fi (e)',7rf (h)). 


E®H^MxN 


E®MxN H ^ M X N 


Whitney sum; E (Smxn H ■= \l(m,,i)eMxN Em ® Hn- 








T^MxN (e(Bh} := (ttm (e) , n,^ (h)) . 


E®H ^ M xN 


E®MxN H ^ M X N 


Tensor product; E ®mxn H ;= LJ(m,n)eMxiv Em ® H„. 








^mIV" (c'^ei ® hi) ■■= i^M {ek),7^lr (he)) (for any kj). 


Trace; natural pairing; tensor/tensor field contraction. Simple tensor expressions are extended linearly. 




tr 


tTV 


Trace on F; tr^ : V*(3V^R, a®vi-^a {v). 




a • V 


a -v V 


Natural pairing; -y-- V* xV ^R, {a, v) i-^ a (v). 




A-B 


A-v B 


Tensor contraction; -v- {11® V*) x {V ®W) ^ U ®W, 
{u ® a) -v {v ® w) := u ® {a -v v) ® w = a {v) u® w. 




S-"T 




Alternate for -v, where V = Vi ® ■ ■ ■ ®Vn- 


S: 


T,S-'^T 




Special notation for n — 2. 




tr 




Trace on F ^ M; trF : T {F* ®m F) ^ (M, K) , 
[trF (cr ®M /)] {m) := a (m) -f^ f (m) for m € M. 




T-f 


cr-F f 


Natural pairing; -f : T (F*) x T (F) ^ C°° (M,R), 
{a -F f) (m) := a (m) -f^ f l"^) ^ 




A-f 


A-Ff 


Natural pairing; -f : F (F ®m F*) x F ^ E, 

(e ®M cr) -F / := e (rn) (a (m) -f^ /) G Em', m := ttm (,/"). 




S-T 


S-fT 


Tensor field contraction; pointwise tensor contraction; 
•f: r(F(8)M F*) xV{F®M G)^V{E®mG), 

[(e ® ct) -f (/ ® 5)] := ("^ ("i) -F™ / (m)) e (m) ® ,g (m). 




S-"T 


S ■Fi<^M---®MFrt T 


Alternate for -f, where F = Fi (g)M ■ ■ ■ ®m En- 


S: 


T,S-'^T 


S -Fi^mF^ T 


Special notation for n = 2. 
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High- 


Mid- 


Low-level 


Description 


Permutations of tensors and tensor fields. 






Right-action of permutations on n-tensors/n-tensor fields; 








(ui ® • • • ® v^Y ■- ® • ■ • ® «„-!(„); (A-^)" = .4''". 


Spaces of sections of bundles. 




Space of smooth sections of the bundle tt^ ; 




V{H) ■- [h e C°° (AT,//) 1 7r^o/i = Idiv}. 


(//), (^f ) 


Space of smooth sections of tt^ along <^; 




r,^ [H) ■- {h G (M, ff) 1 7r# o ft = 0}. 


Vertical bundle, puUback bundle, projection maps, pullback of sections. 








Vertical bundle over E ^ M;VE ■- keiTnf.j < TE. 
projection map tt]^^ := tte^ \ve- 




(l>*H 




Pullback bundle; (t>*H := {{m, h) € M x H \ <!) (m) = n (h)}. 
nfj ^ (m, h) := m; "'^ (m, h) := /i. 




4,*h 


Pullback of section h £ T {H); (f)* h £ T {<()* H) 








defined by p^*^ o rj,* h = h; h e T (H). 


Covariant derivatives; partial covariant derivatives. 


VL 


W L 




Natural linear covariant derivative; differential of functions; 
V"^*L := dL e r(r*M), where L € C°° (M,R). 


VX 


vx 

V^X 


V^X 


Linear covariant derivative on vector bundle E ^ M; 
V^X €T{E(g)M T*M), where X G P {E). 


V(?!> 






Tangent map as tensor field; 
V ^'^^(j) G r (8)Af T*M), where G C°° (M, X). 


Va 






Pullback covariant derivative; a G F {(j)* H); 
defined by V'^'"(j)*h = (j>*V"h -^-tn V "^^(ji; /i G P (//). 




i,ci , ■ • 




Partial differential of functions; 
L,c, e r(F*), defined by V = Ef^i i.-, -f. c,. 




X,ci, . . 




Partial linear covariant derivative; 
X G r (£ ®Af J;*), defined by V ^X = Er=i 'f-, c;. 




<f},Mi , . • 


■ > <^,A/„ 


Partial derivative decomposition of tangent map; 
<^,Af, G P (0*TX (g)Af pr* r*MO, 
where A/ = Mi x • • • x M„, pr, := prf^, and 

^ <P = Z^«=l V,Mi -pr* TA/i V • pr^. 


Covariant Hessians. 






Covariant Hessian of functions; 


W W L 




7T*Afy Af^R^ 


V^L G r(r*M®T*M); L G C°° (A/,R). 






7i5®T*A//yBj^ 


Covariant Hessian on vector bundle E — >■ Af; 








V^X G r (£ ® T*Af ® T'A//); X G P (£). 






Covariant Hessian of maps; 




Y <i>'TNig,MT'M Y M^N ^ 


£T{(j>*TN ®T*M ®T*M). (j> £ C°° (M,N). 


Derivative conventions. 






Directional derivative notation; Vxe ~ Ve -ta/ X. 


V"e (Xi (g) • • 


(g) X„_i ® X„) 


Iterated covariant derivative convention; 








defined by (Vx„ V^-^e) -""i (Xi ® • ■ • ® X„_i). 


i? (X, y) — -Vx Vy + Vy Vx + V[x,yi 


Curvature operator; R (X, F) e = V^e : (X (g) F - Y ® X). 




2 : M ^ M X J, m (m, 0) 


Evaluation-at-zero map. 




z*di 


= <5, 


Pullback formulation of derivative-at-zero. 



For more on relevant introductory theory on manifolds, bundles and Riemannian geometry, see [10], [S], 
[12], [9]. 
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Part I 

Mathematical Setting 

1 Using Strong Typing to Error- Check Calculations 

Linear algebra is an excellent setting for discussion of the strong typing [1] of a language, a concept used 
in the design of computer programming languages. The idea is that when the human-readable source code 
of a program is compiled (translated into machine-readable instructions), the compiler (the program which 
performs this translation) or runtime (the software which executes the code) verifies that the program objects 
are being used in a well-defined way, producing an error for each operation that is not well-defined. For 
example, a vector-type value would not be allowed to be added to a permutation-type value, even though 
tuples of unsigned integers (i.e. bytes) are used by the computer to represent both, and the computer's 
processing unit could add together their byte-valued representations. However, such an operation would be 
meaningless with respect to the types of the operands. The result of the operation would depend on the non- 
canonical choice of representation for each object. Strong type checking has the advantage of catching many 
programming errors, including most importantly those resulting from an inherent misuse of the program's 
objects. Within this paper, certain type-explicit notations will be used to provide forms of type awareness 
conducive to error-checking. 

An important example of semi-strong typing in math is Penrose's abstract index notation [ i i], modeled 
on Einstein's summation convention, in which linear algebra and tensor calculus are implemented using 
indexed objects (tensors) having a certain number and order of "up" and "down" indices (an abstraction of 
the genuine basis/coordinate expressions in which the indexed objects are arrays of scalars/functions) . A 
non-indexed tensor is a scalar value, a tensor having a single up or down index is a vector or covector value 
respectively, a tensor having an up and a down index is an endomorphism, and so forth. The tensors are 
contracted by pairing a certain number of up indices with the same number of down indices, resulting in an 
object having as indices the uncontracted indices. 

For example, given a finite-dimensional inner product space {V,g), where g is a (2) -tensor (having the 
form gij, i.e. two down indices), a vector v € V is Si (j'))-tensor, and the length of v is ^^v'^gijV^ . If dim V > 1, 
then /\^ V has positive dimension, its vectors each being (p)-tensors, and Gijke ■— Sikdji — gudjk is an inner 
product on /\^ V (which must be a (")-tensor in order to contract with two (g)-tensors). 

Certain type errors are detected by use of abstract index notation in the form of index mismatch. For 
example, with (V, g) as above, if a G V* , then a is a ('j')-tensor. Because of the repeated j down indices, the 
expression gijOj typically indicates a type error; gij can't contract with aj because of incompatible valence 
(valence being the number of up and down indices). Furthermore, multiplying a (2) -tensor with a ("j*) -tensor 
without contraction should result in a (g)-tensor, which should be denoted using three indices, as in gijak- 

The only explicit type information provided by abstract index notation is that of valence. The "semi" 
qualifier mentioned earlier is earned by the lack of distinction between the different spaces in which the 
tensors reside. For example, if U,V,W are finite-dimensional vector spaces, then linear maps A: U —?' V 
and B: V ^ W can be written as (J)-tensors, and their composition B o A: U ^ W is written as the 
tensor contraction {B o A)^ = B^A'j. However, while the expression A\B^ makes sense in terms of valence 
compatibility (i.e. grammatically), the composition "Ao_B" that it should represent is not well-defined. Thus 
this form of type error is not caught by abstract index notation, since the domains/codomains of the linear 
maps must be checked separately. 

The use of dimensional analysis (the abstract use of units such as kilograms, seconds, etc) in Physics is 
an important precedent of strong typing. Each quantity has an associated "dimension" (this is a different 
meaning from the "dimension" of linear algebra) which is expressed as a fraction of powers of formal symbols. 
The ordinary algebraic rules for fractions and formal symbols are used for the dimensional components, with 
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the further requirement that addition and equahty may only occur between quantities having the same 
dimension. 

For example, if E, M and C represent the dimensions of energy, mass and cost, respectively, and if the 
energy storage density pE/M of a battery manufacturing process is known (having dimensions energy per 
mass) and the manufacturing weight yield w M/C of the battery is known (having dimensions mass per cost) , 
then under the algebraic convensions of dimensional analysis, calculating the energy storage per cost (which 
should have dimensions energy per cost) is simple; 

/ E\ / M\ EM E 

(the M symbols cancel in the fraction). Here, both p and w are real numbers, and besides using the well- 
definedness of real multiplication, no type-checking is done in the expression pw. 

A contrasting example is the quantity p/w, having dimensions EC/M^. However, these dimensions may 
be considered to be meaningless in the given context. The quantity's type adds meaning to the real- valued 
quantity, and while the quantity is well-defined as a real number, the uselessness of the type may indicate 
that an error has been made in the calculations. For example, a type mismatch between the two sides of an 
equation is a strong indication of error. 

This is also a convenient way to think about the chain rule of calculus. If z {y), y (x), and x measure 
real-valued quantities, then z (y (x)) measures the quantity z with respect to quantity x. Using Z, Y and X 
for the dimensions of the quantities z, y and x respectively, the derivative has units Z/X. When worked 
out, the dimensions for the quantities on either side of the equation — will match exactly, having a 
non-coincidental similarity to the calculation in the battery product example. 

2 Telescoping Notation (aka Don't Fear the Verbosity) 

Many of the computations developed in this paper will appear to be overly pedantic, owing to the decoration- 
heavy notation that will be introduced in Section 3. This decoration is largely for the purpose of tracking the 
myriad of types in the type system and to assist the human reader or writer in making sense of and error- 
checking the expressions involved. The pedantry in this paper plays the role of introducing the technique. 
The notation is designed to telescope^, meaning that there is a spectrum of notational decoration; from 

• pedantically type-specified, verbose, and decoration-heavy, where [almost] no types must be inferred 
from context and there is little work or expertise required on the part of the reader, to 

• somewhat decorated but more compact, where the reader must do a little bit of thinking to infer some 
types, all the way to 

• tersely notated with minimal type decoration, where [almost] all types must be inferred from context 
and the reader must either do a lot of thinking or be relatively experienced. 

Additionally, some of the chosen symbols are meant to obey the same telescoping range of specifity. For 
example, compare n-fold tensor contraction with type-specified ■]/!<»■■■<»¥„ as discussed in Section 3, or the 
symbols V, V , and V as discussed in Section 10. Tersely notated computations can be seen in Section 10, 
while fully-verbose computations abound in the careful exposition of Part H. 

3 Strongly- Typed Linear Algebra via Tensor Products 

A fully strongly typed formulation of linear algebra will now be developed which enjoys a level of abstraction 
and flexibility similar to that of Penrose's abstract index notation. Emphasis will be placed on notational 

^Credit for the notion of telescoping notation is due in part to David DeConde, during one of many enjoyable and insightful 
conversations. 
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and conceptual regularity via a tensor formalism, coupled with a notion of "untangled" expression which 
exploits and notationally depicts the associativity of linear composition. 

If V denotes a finite-dimensional vector space, then let 

■v: V* xV -^R, {a,v) ^a{v) 

denote the natural pairing on V, and denote -y {a,v) using the infix notation a -y v. The natural pairing 
is a nondegenerate bilinear form and its bilinearity gives the expression a -y v multiplicative semantics 
(distributivity and commutativity with scalar multiplication) , thereby justifying the use of the infix • operator 
normally reserved for multiplication. The natural pairing subscript V is seemingly pedantic, but will prove 
to be an invaluable tool for articulating and navigating the rich type system of the linear algebraic and vector 
bundle constructions used in this paper. When clear from context, the subscript V may be omitted. 

Because V is finite-dimensional, it is reflexive (i.e. the canonical injection V y** ^ w i— > (a i— > a (v)) is 
a linear isomorphism). Thus the natural pairing -y on V* can be written naturally as 

■v-.V xV* ^ E, {v,a) ^a{v). 

Note that a -y v — v -v* d. Though subtle, the distinction between -y and -y- is important within the type 
system used in this paper. 

Through a universal mapping property of multilinear maps, the bilinear forms -y and -y* descend to the 
natural trace maps 

try : (g) V ^ M, a ® t; a (u) , and 
try. -.V ®V* ^R,v®a^ ol{v) , 

each extended linearly to non-simple tensors. These operations can also be called tensor contraction. 
Noting that (F* ® Y)* and {V ® V)* are canonically isomorphic ioV ®V* and V* ®V respectively, then 
for each A € V* ® V and B G V (S) V* , it follows that try (A) — Idy* -y^yA and try. (B) = Idy -y^y* B. 

Definition 3.1 (Linear maps as tensors). Let V and W be finite-dimensional vector spaces, and let 
Hom(y, ly) denote the space of vector space morphisms from V to W (i.e. linear maps). The linear 
isomorphism 

W(g)V* Hom {V, W) , 

w (8) a i-> {V ^ W, V i-^ w {a -y v)) 

(extended linearly to general tensors) will play a central conceptual role in the calculations employed in this 
paper, as it will facilitate constructions which would otherwise be awkward or difficult to express. Linear 
maps and appropriately typed tensor products will be identified via this isomorphism. 

Given bases vi, . . . ,Vm G V and wi, . . . ,Wn € W, and dual bases , . . . , e V* and w^, . . . , € W* , 
a linear map A: V ^ W can be written under the identification in (3.1) as 

A = A'jW^(E)v\ 

where A* = w"^ -w A -y vj £ M, and in fact [^*] e Mnxm (E) is the matrix representation of A with respect 
to the bases wi, . . . , Vm G V and wi, . . . , Wn G W , noting that the i and j indices denote the "output" and 
"input" components of A respectively. Tensors are therefore the strongly typed analog of matrices, where 
the W <^V* type information is carried by the Wi (8) component. 

One clarifying example of the tensor formulation is the adjoint operation of the natural pairing, also 
known as forming the dual of a linear map. It is straightforward to show that 

*:W(g)V* V* (g)W, 
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(where the map is extended linearly to general tensors) . This is literally the tensor abstraction of the matrix 
transpose operation; if A = Aj Wi (8) , then the dual A is A* = A^ a' (g) Wj. The matrix of A* is precisely 
the transpose of the matrix of A with respect to the relevant bases. The map * itself can be written as a 
4-tensor * e V* (E) W (E) W* (E) V , where A* = * -w^.v A. 

There is a notion of the natural pairing of tensor products, which implements composition and evaluation 
of linear maps, and can be thought of as a natural generalization of scalar multiplication in a field. If U, V, 
and W are each finite-dimensional vector spaces, then the bilinear form 

{U(g)V*)x {V(^W) U(^R(g)W ^U(g)W, 

{u® a,v ®w) {a -v v) ® w = {a -v v)u® w 

will be denoted also by the infix notation -y (i.e. (w a) -y (u «;) = (a w) w (g) w). IfV itself is a tensor 

product of n factors which are clear from context, then -y may be denoted by (think an n-fold tensor 
contraction). If n = 2, then typically : is used in place of For example, from above. A* = *-w®V' A = * : A. 

Given a permutation o" G 5„, define a right-action by cr : Vi E) ■ ■ • (E Vn — >■ ^^^-1(1) §5 • • • mapping 
elements in the obvious way. For example, (2 3 4) acting on vi<Ev2<S>V3E>V4 puts the second factor in the third 
position, the third factor in the fourth position, and the fourth factor in the second, giving t;i 1;4 U2 113. 
This permutation is itself a linear map and of course can be written as a tensor. However, because it is 
defined in terms of a right action, the "domain factors" will come on the left. Thus a is written as a tensor 
of the form Fj* • • • ^7-1(1) • • • (i.e. as a 2n-tensor). Certain tensor constructions are 

conducive to using such permutations. In the above example. * can be written as (1 2) € W* E)V (E)V* E) W. 

The permutation right-action also works naturally when notated using superscripts. For example, if 
B gU (giV (giW, then 

2) B -u-m'^w (1 2) e y [/ 

and so 

(2 3) 

^jj(i2)^ = [B -wisiV'^w (12)) -visiU'isiW' (2 3) 

= B -u'^v^w ((1 2) -v^U'tsw (23)) 

= B -u'^V'^w (1 2) (2 3) 

= B -u'^'^w (1 3 2) e y f/. 

When multiplying the permutations (1 2) and (2 3) in the third line, it is important to note that they are 
read left-to-right, since they are acting on B on the right. 

The inline cycle notation is somewhat ambiguous in isolation because the number of factors in the 
domain/codomain is not specified, let alone their types. This information can sometimes be inferred from 
context, such as from the natural pairing subscripts, as in the following examples. 

Example 3.2 (Linearizing the inversion map). Let i: GL{V) — > GL{V) , A A^^, i.e. the linear map 
inversion operator, where GL {V) is an open submanifold of V^V* via the isomorphism VE)V* = Horn (F, V). 
Its finearization (derivative) Di: GL {V) ^ V (g)V* (g) {V (g)V*y ^ V (g)V* (g)V* (g)V at A G GL {V) in the 
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direction B gTa [GL {V))^V®V* is 

Di {A) -v^v B = Di -y^y. 5{A + eB) 
= 5{i{A + eB)) 

= 6[{A + eB)-') 

= 5l{{l + eBA-^)A)~') 
= 6{A-^{l + eBA-^)-^) 



n=0 



{\—eBA ^|is taken arbitrarily small due to the derivative S := \e=o 
being evaluated in an arbitrarily small neighborhood of e = 0) 
= S{A-^-eA-^BA-^+0{e^)) 



■V 



B-vA-\ 



In order to "move" the B parameter out so that it plays the same syntactical role as in the original expression 

Di (A) ■ B. via adjacent natural pairing, sonic simple tensor manipulations can be done. The process is easily 
and accurately expressed via diagram. The following sequence of diagrams is a sequence of equalities. The 
diagram should be self-explanatory, but for reference, the number of boxes for a particular label denotes 
the rank of the tensor, with each box labeled with its type. The lines connecting various boxes are natural 
pairings, and the circles represent the unpaired "slots", which comprise the type of the resulting expression. 



Di{A) 



B 





V 


V* 


V* 


V 




V 


V* 















-A- 



O- V 



V* 



B 



V 



V* 



A- 



V 



V* 



-O 



The following step is nothing but moving the boxes for B out; the natural pairings still apply to the same 
slots, hence the cables dangling below. 



-A- 



A- 



B 



O- V 




In this setting, a tensor product amounts to flippantly gluing boxes together. 









®A- 


1 


B 




V 


V* 


V 


V* 


—o 


V 


V* 
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In order for B to be naturally paired in the same adjacent manner as in the original expression Di (A) ■ B, 
the slots of —A~^ (g) A^^ must be permuted; the second moves to the third, the third to the fourth, and the 
fourth to the second. 



-{A 


(K) 




(2 3 4) 


V 


V* 


V* 


V 



B 



V V* 



The first diagram equals the last one, thus Di {A) -v^v B = — (^A ^ A ^) -v^v* B, and by the 

nondegeneracy of the natural pairing on V <SiV*, this implies that Di (A) = - (A^^ (g) A^^)^^^''', noting 
that the statement of this expression does not require the direction vector B. The permutation exponent 
(2 34) can be calculated easily using simple tensors, if not by the above diagrammatic manipulations; 

(oi (g) 02) • (&i (8) 62) ■ (03 ® 0,4) = (ai (g) 04 ® a2 (8) as) : (5i <Si 62) = (oi <8) 02 ® 03 ® 04)'-^^''^ : (61 (g) 62) • 
Here, the expression (ai (Xi 02) • (61 (81 62) • (03 (81 04) represents the expression A^^ ■ B ■ A^^. 

The next example will later be extended to the setting of Riemannian manifolds and their metric tensor 
fields, and put to use to formulate what are known as harmonic maps (see (12.7)). But first, a new tensor 
operation must be defined. 

Definition 3.3 (Parallel tensor product). If U, V, W, X are vector spaces and A E U (E) V and B E W (E) X , 
then define their parallel tensor product A Kl B by 

AMB ■.^{A(g, B)^^^^ e {U (g)W)(g) {V (gi X) . 

The parentheses in the type specification are unnecessary, but hint at what the tensor decomposition for the 
quantity A^ B should be, if used as an operand to M again (see below). 

If A and B represent linear maps, then AM B £ {1/0 W) E) {V X) represents their tensor product as 
linear maps (the parentheses are unnecessary but hint at what the domain and codomain are, and for use of 
A Kl _B as an operand in another parallel tensor product), which is a "parallel" composition; if a G V^* and 
13 eX*, then {A M B) -y.^x- {a®[i) = {A -v a) (g> {B -x- /?). 

There is a slight ambiguity in the notation coming from a lack of specification on how the tensor product 
of the operands is decomposed in the case when there is more than one such decomposition. Notation 
explicitly resolving this ambiguity will not be needed in this paper as the relevant tensor product is usually 
clear from context. 

The parallel tensor product is associative; if Y and Z are also vector spaces and C eY ® Z , then 
{AMB)MC ^ AM{BMC) {U ®W ®Y)(E{V (EX(E Z) , 
allowing multiply-parallel tensor products. 

Example 3.4 (Tensor product of inner product spaces). If (V, g) and (W, h) are inner product spaces (noting 
that g E V* ® V* and h G W* ® W* are symmetric, i.e. literally invariant under (1 2)), then W ®V* is 
an inner product space having induced inner product k {A,B) := try -v ^* 'W h -yy B^ Here, the 
"inputs" of yl and B (the V* factors) are being paired using G V(S)V, while the "outputs" (the W factors) 
are being paired using h G W* (E) W* , and the trace is used to "complete the cycle" by plugging the output 
into the input, thereby producing a real number. The expression k {A, B) can be written in a more natural 
way, which takes advantage of the linear composition, as A : k : B (or, pedantically, A ■w(i)V k - w^v B), 
instead of the more common but awkward trace expression mentioned earlier. In the tensor formalism, the 
inner product k should have type W* (E)V E) W* (E V. Permuting the middle two components of the 4-tensor 
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h (g) g W* (g) W* (K) F (K) F gives the correct type. In fact, k = hM g~^. A further advantage to this 
formulation is that if any or all of A, fc, _B are functions, there is a clear product rule for derivatives of the 
expression A : k : B. This is something that is used critically in Riemannian geometry in the form of 
covariant derivatives of tensor fields (see (8.2)). 

In this paper, the main use of the tensor formulation of linear maps is twofold: to facilitate linear algebraic 
constructions which would otherwise be difficult or awkward (this includes the ability to express derivatives of 
[possibly vector or manifold- valued] maps without needing to "plug in" the derivative's directional argument), 
and to make clear the product-rule behavior of many important differentiable constructions. 



4 Bundle Constructions 

In order to use the calculus of variations involving Lagrangians depending tangent maps of maps between 
smooth manifolds, it suffices to consider Lagrangians defined on smooth vector bundle morphisms. Con- 
tinuing in the style of the previous section, a "full" tensor product of smooth vector bundles (4.4) will be 
formulated which will then allow expression of smooth vector bundle morphisms as tensor fields, sometimes 
called two-point tensor fields [1 ' , pg. 70]. The full arsenal of tensor calculus can then be used to considerable 
advantage. 

First, some definitions and simpler bundle constructions will be introduced. A smooth [fiber] bundle 
(hereafter refered to simply as a smooth bundle) is a 4-tuple {£,E,Tr,N) where £, E and N are smooth 
manifolds and tt: E N is locally trivial, i.e. N is covered by open sets {C/q} such that tt^^ {Ua) = x £ 
as smooth manifolds. The manifolds £, E and N are called the typical fiber, the total space, and the 
base space respectively. The map tt is called the bundle projection. The full 4-tuple specifying a bundle 
can be recovered from the bundle projection map, so a locally trivial smooth map can be said to define a 
smooth bundle. The dimension of the typical fiber of a bundle will be called its rank, and will be denoted 
by rankTT or rank£^ when the bundle is understood from context. 

The space of smooth sections of a smooth bundle defined by tt : i? — > iV is 

r (tt) {a e C°° (iV, £;) I TT o a = Wat} , 

and may also be denoted by F {E), if the bundle is clear from context. If nonempty, F (tt) is generally an 
infinite-dimensional manifold (the exception being when the base space N is finite). 

Proposition 4.1 (Trivial bundle). Let M and N he smooth manifolds. With M ^ N := M x N and 

^M^N :=prfxiV: M » iV ^ TV 

defines a smooth bundle (^M, M » N, ti^'^^^ ^ jV^ , called a trivial bundle. Similarly, with M N := M x N 
and TT^i^N ppMxAr ■ M¥rN ^ M, (^N, M ^ N, n^^N ^ ^ ^^^^^^i i^^^dle. 

No proof is deemed necessary for (4.1), as each bundle projection trivializes globally in the obvious way. 
The ^ symbol is a composite of x (indicating direct product) and ^ or <- (indicating the base space). 
If M and N are smooth manifolds as in (4.1), then there are two particularly useful natural identifications. 

C°° (Af , N)^T {M ^ N) C°° (Af, TV) = F (A^ ^ M) 

Mm xm</' 0i-^-0XMldM 
prf><^o$^$ prf><*^o$^$ 

These identifications can be thought of identifying a map e C°° (Af , A^) with its graph in Af x A and 
A X M respectively. Furthermore, this allows bundle theory to be applied to reasoning about spaces of 
maps. The symbols M ¥: N and A^ » A/ now carry a significant amount of meaning. Generally N M 
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will be used in this paper, for consistency with the Horn (V, W) = W ^V* convention discussed in Section 
3. The symbols ^ and ^ are examples of telescoping notation, as they are built notationally on x , and 
conceptually on the direct product, which is what is denoted by x. The arrow portion of the symbols can 
be discarded when type-specificity is not needed. 

Proposition 4.2 (Direct product bundle). Let (^S,E,n^,M) and (^J^ , F, , N) be smooth bundles. Then 

TT^ X TT^ : E X F ^ M X N, (e, /) ^ (tt^ (e) , tt^ (/)) 

defines a smooth bundle [£ x F , E x F, x , M x N) . This bundle is called the direct product of 
and TT^ , and is not necessarily a trivial bundle. 

Proof. Let : (tt^)"^ (U) ^ U x £ and : (vr^)"^ {V) ^ V x F trivialize and tt^ over open sets 
U C M and V Q N respectively. Then 

4-^ X (tt^)"^ ([/) X (tt^)"^ iV)^Ux£xVxF 

has inverse (*^) X {^^) . Note that 

(tt^)"' ([/) X {n^y' (V) = {{e,f)eExF\7r^ (e) e U, (/) G V} 

^{{e,f)eExF \ (tt^xtt^) {e, f) e U x V} 
= {tt^ X TT^y' {U X V) , 

and that 

P: {U X £) X {V X F) ^ {U xV) X {£ X F) , {{u, e) , (v, /)) ^ ((u, v) , (e, /)) 
defines a diffeomorphism. Then 

xi,Ey:F ._p^ ^ . ^^E ^ ^Fyi (^u xV) ^ (U xV) X {£ X F) 
defines a diffeomorphism, and 

= pr(t^x^)x(^x^) oP o (vl/^ X (e, /) 
= pr(t'x^)x(^x^) oP {^^ (e) , vl/^ (/)) 

= prpx^)x(^ x^) oF ((pr^x^ o^^ (e) , pr2^><^ ov]/^ (e)) , (pr^x^ o*^ (/) , pr2^x^ o*^ (/))) 
= prpx^)x(^x^) ((pr^^ ovl,^ (e) ,prr><^o*^ (/)) , (pr^^^^ o^>^ {e) o^^ (/))) 

= (prfx£o*^(e),prp^o*^(/)) 
= (^^(e),^^(/)) 
= (^^x^^)(e,/), 

showing that vji-^xF trivializes tt^ x tt^ over J7 x 1/ C MxN. Since A/ x can be covered by such trivializing 
sets, this establishes that x tt^ defines a smooth bundle. The typical fiber of tt^ x tt^ is £ x F. □ 

A smooth vector bundle is a fiber bundle whose typical fiber is a vector space and whose local 
trivializations are linear isomorphisms when restricted to each fiber. If {£,E,tt,M) is a smooth vector 
bundle, then its dual vector bundle {£* , E* ,7t* , M) is a smooth vector bundle defined in the following 
way. 

E* U (Ep)* , Tr*:E*^ Af, r,p ^ p. 
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Because f is a vector space, the notation 8* is already defined. In analogy with Section 3, there are natural 
pairings on a vector bundle and its dual, defined simply by evaluation. If p S M, 77 S E* and e Ep, then 
r] ■ E e ■— f] ' Ep e and e ■ e V '■— ^ ' Ej, "H- Both expressions evaluate to 77 (e). Natural traces and rt-fold tensor 
contraction can be defined analogously. Again, while seemingly pedantic, the subscripted natural pairing 
notation will prove to be a valuable tool in articulating and error-checking calculations involving vector 
bundles. To generalize the rest of Section 3 will require the definition of additional structures. 

For the remainder of this section, let (^S,E,n^,M) and ( J^, F, tt^ , A^) now be smooth vector bundles. 
The following construction is essentially an alternate notation for x tt^ : E x F ^ M x N, but is one 
that takes advantage of the fact that tt^ and tt^ are vector bundles, and encodes in the notation the fact 
that the resulting construction is also a vector bundle. This is analogous to how V x W is a vector space 
with a natural structure if V and W are vector spaces, except that this is usually denoted hy V (B W. 

Proposition 4.3 ("Full" direct sum vector bundle). // 

E®mxnF:= ExF, 

Then 

71"^ ®MxN TT^ := XTT^: E®mxN F M x N 

defines a smooth vector bundle Q) J-, E (BmxN E, tt^ ®mxN '^^ , M x iV) , called the full direct sum of 
TT^ and TT^ . 

For each {p.q) ^ M x N, the vector space structure on (tt-^ ®mxN t^^) {PtQ) given in the following 
way. Let a eR and {ei, fi) ,{e2, f2) ^ {'^^ ®MxN T^'') ^ {p,q)- Then 

a (ei,/i) + (62, /2) = (aei + 62,0/1 + f2) ■ 

It is critical to see (4-5) for remarks on notation. 

Proof Let U, V, £, T, P, l-^, and ^^^-^^ be as in the proof of (4.2), and define ^^|ES>M^NF _ ^bxf^ 
Noting that \l/^®MxivF smooth bundle isomorphism over Iduxv, so to show that x^/^^mxnF jg ,^ linear 
isomorphism in each fiber, it suffices to show that it is linear in each fiber. Let a G M, (p,q) Q U x V and 
(ei,/i),(e2,/2) e {n''®MxN7r^y\p,q)- Then 

^EeM.«F (^g^ ^ + /2) - P o (*^ X (aei + 62, a/i + /a) 

- P (^-^ (aei + 62) , (a/i + /2)) 
= P (a vf^ (ei) + vl^^ (62) , a (/i) + {f^)) 

(by trivial vector bundle structures on U x fand V x F) 
= a P (ei) , (/i)) + P (62) , (/2)) 

(by trivial vector bundle structure on {U x V) x {£ x F)) 
= aPo X (ei,/i) + Po (*^ X (62, /2) 
= a ^H^^^m^nF _^ ^seMxivf (g2, /2) . 

Thus \^^®mxnF jg lijigg^j. each fiber, and because it is invertible, it is a linear isomorphism in each fiber. 
In particular, \I>^®AfxNF jg smooth vector bundle isomorphism over IduxV- Applying (^IiE^m^nF^ 
the above equation gives 

(aei + e2,a/i + /2) = a (ei, /i) + (e2,/2) , 
as desired. □ 

This construction differs from the Whitney sum of two vector bundles, as the base spaces of the bundles are 
kept separate, and aren't even required to be the same. This allows the identification of T (M x N) — > MxN 
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as TM (Bmxn TN M x N, which may be done without comment later in this paper. Some important 
related structures are pr^ nj/'' : pr^ TM -J> M x iV and prj tt^^ : pr^ TN M x N, where pr, := prf^""^. 

The next construction is what will be used in the implementation of smooth vector bundle morphisms 
as tensor fields. 

Proposition 4.4 ("Full" tensor product bundle). // 

E <8)MxAf F '■= Ep (g) Fq (disjoint union), 

{p,q)eMxN 

Then 

TT^ (Smxn tt^ : E (^MxN F^ M X N, a^^Ci ® fj ^ (tt^ (d) , tt^ (/i)) (here, a'^ € R) 

defines a smooth vector bundle (^8 ® F,E ®mxN F, tt^ <E)mxN tt^, M x , called the full tensor product^ 
of TT^ and TT^ . 

It is critical to see (4-5) for remarks on notation. 

Proof. Since the argument a'-'ci (X) fj in the definition of tt^ <E)mxn is not necessarily unique, the well- 
definedness of tt^ <E)mxn tt^ must be shown. Let a^^e] (E) fj — P^-'ef ® /J. Then in particular, d^^e} ® 
fj,f3^^ef (g) fj & Ep® Fq for some (p, q) £ M x N , and therefore e^, ef € Ep and fj, fj € Fq for each index 
i and j. Thus tt^ (e}) = p = tt^ (ef) and tt^ (^fl) — q = (/i)' the expression defining tt^ ®mxn 
is well-defined. 

The set E ®mxn F does not have an a priori global smooth manifold structure, as it is defined as the 
disjoint union of vector spaces. A smooth manifold structure compatible with that of the constituent vector 
spaces will now be defined. 

Let : (tt^) {U)^Ux8 and : (tt^) {V)^VxF triviahze -k^ and tt^ over open sets U QM 
and V Q N respectively, such that and are each linear in each fiber. Define 

{t:^ ®MxN^^y\u xV) ^ {UxV)x{£®T) 

X ^ {{T:^®MxN^n{X),{{w'i'''o^^)®{j,rl^^o-^P)){X)). 

The map jg well-defined and smooth in each fiber by construction, since for each {p,q) £U xV, 

(pr^^^ o*^) ® (pr^^-^ o*^) \e^^e, : Ep®Eq^ E®F 

is a linear isomorphism by construction. Additionally, -^^^m^nF j^g^g ]-,ggjj constructed so that 

p^(C/xy)x(£«Mx«^) „^iJ»Mx«J^ ^ ^^^^^ 

on (tt^ (^mxn tt^) ""^ (t/ X F). Define the smooth structure on (tt^ ^AfxAf tt^) (t/ x C (^mxn F 
by declaring vJ/^^'^mxatF j^g diffeomorphism. The map tt^ <E)mxn is trivialized over U x V. The set 
'S>MxN F can be covered by such trivializing open sets. Thus E ®)mxn F has been shown to be locally 
diffeomorphic to the direct product of smooth manifolds, and therefore it has been shown to be a smooth 
manifold. With respect to the smooth structure on E (E)mxn F, the map tt^ 'Eimxn is smooth, and has 
therefore been shown to define a smooth vector bundle. □ 

Remark 4.5 (Notation regarding base space). The "full" direct sum (4.3) and "full" tensor product (4.4) 
bundle constructions allow direct sums and tensor products to be taken of vector bundles when the base 
spaces differ. If the base spaces are the same, then the construction "joins" them, producing a vector bundle 
over that shared base space. For example, if E and F are vector bundles over M , then E®mxm F has base 
space M x M, while E ® F has base space M . The base space can be specified in either case as a notational 
aide; the latter example would be written as E ®m F. If no subscript is provided on the ® symbol, then 

^This construction is alluded to in [~, pg. 121], but is not defined or discussed. 
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the base spaces are "joined" if possible (if they are the same space), otherwise they are kept separate, as in 
the "full" tensor product construction. This notational convention conforms to the standard Whitney sum 
and tensor product bundle notation, and uses the notion of telescoping notation to provide more specificity 
when necessary. 

Given a fiber bundle, a natural vector bundle can be constructed "on top" of it, essentially quantifying 
the variations of bundle elements along each fiber. This is known as the vertical bundle, and it plays a 
critical role in the development of Ehresmann connections, which provide the "horizontal complement" to 
the vertical bundle. 

Proposition 4.6 (Vertical bundle). Let tt^ : E ^ M define a smooth [fiber] bundle. IfVE := kcrTn^ < 
TE, then tt^^ :— tt^^ \ve- — > E defines a smooth vector bundle subbundle of tt^^ : TE — > E, called 
the vertical bundle over E. Furthermore, the fiber over e Cz E is VeE = TeE^^Ei^f,-^ < T^E. 

Proof. Because ir^ is a smooth surjective submersion, VE — is a subbundle of TE E having corank 
dim M and therefore rank equal to that of E. Furthermore, if e G E and e i— )■ € E^e^^^-^ , then Se^ represents 
an arbitrary element of TgE^Ef^^^, and Ttt^ (^e^) = 5 (tt^ (bc)) — S {n (e)) = 0, showing that Se^ S kerT-Tr^, 
and therefore that Je^ € VeE. This shows that T^E^Ef^^) ^ VeE. Because dimTeE^Ef^e) — rank£', this shows 
that TeE^Ei^e) = VeE. □ 

Given the extra structure that a vector bundle provides over a [fiber] bundle, there is a canonical smooth 
vector bundle isomorphism which adds significant value to the puUback bundle formalism used throughout 
this paper. This can be seen put to greatest use in Part II, for example, in development of the first variation 
(see (12.1)). 

Proposition 4.7 (Vertical bundle as pullback). //tt: _E — >■ M defines a smooth vector bundle, then 

L'fi:TT*E VE, 

{x, y) t-^ 6^{x + ey) 

is a smooth vector bundle isomorphism over Id^, called the vertical lift, having inverse 

VE X . f V 
L^.E-.de^ ^ [eo,hm — — 1, 

where, without loss of generality, is an E-valued variation which lies entirely in a single fiber. 

Proof. It is clear that Ly^ is linear and injective on each fiber. By a dimension counting argument, it is 
therefore an isomorphism on each fiber. Because it preserves the basepoint, it is a vector bundle isomorphism 
over Ids. Because the map {x,y,e) i— >■ x + ey is smooth, so is the defining expression for Ly^, thereby 
establishing smoothness. That i]^,^ inverts Ly£' is a trivial calculation. □ 

5 Strongly- Typed Tensor Field Operations 

Because vector bundles and the related operations can be thought of conceptually as "sheaves of linear al- 
gebra", the constructions in Section 3, generalized earlier in this section, can be further generalized to the 
setting of sections of vector bundles. 

If E, F,G are smooth vector bundles over M, then define the natural pairing of a tensor field with a 
vector: 

■F:T{E(g)M F*) X F E, 

(e®M0,/) ^ e (tt^ (/)) [0 (^^ (/)) /] , 
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extending linearly to general tensor fields. Further, define the natural pairing of tensor fields: 

■p:r{E<S)MF*)xT{F<»MG) ^ r{E<S)MG), 

{e®M (t>J ®M g) ^ {p^ (i{p)®M {(t>{p) -F,, f {p)) ®M 9{p)) 
= {p^ {(j) (p) -F^ f (p)) (e (8)M g) {p)) , 

extending linearly to general tensor fields. This multiple use of the • p symbol is a concept known as opera- 
tor overloading in computer programming. No ambiguity is caused by this overloading, as the particular 
use can be inferred from the types of the operands. As before, the subscript F may be optionally omitted 
when clear from context. 

The permutations defined in Section 3 are generalized as tensor fields. If Fi, . . . , Fn are smooth vector 
bundles over M , and ct G S'„ is a permutation, then a can act on -Fi (8)m ■ ■ • ®m Fn by permuting its factors, 
and therefore can be identified with a tensor field 

(T e r {F^ (^M ■ ■ ■ <^M K (g)Af F^-i(i) <»M ■ ■ ■ ®M -F<T-i(„)) 

defined by 

(/l ®M ■ ■ ■ ®M fn) ■F*(g,M-0MF* Cr := /o-i(l) ®M ' ' ' ®M /<T-i(n)- 

An important feature of such permutation tensor fields is that they are parallel with respect to covariant 
derivatives on the factors Fi, . . . , F„ (see (8.12) for more on this). 

6 Fullback Bundles 

The puUback bundle, defined below, is a crucial building block for many important bundle constructions, as 
it enriches the type system dramatically, and allows the tensor formulation of linear algebra to be extended 
to the vector bundle setting. In particular, the abstract, global formulation of the space of smooth vector 
bundle morphisms over a map 0: M — iV is achieved quite cleanly using a pullback bundle. Furthermore, 
the use of pullback bundles and pullback covariant derivatives simplifies what would otherwise be local co- 
ordinate calculations, thereby giving more insight into the geometric structure of the problem. 

For the duration of this section, let {F, F, tt, N) be a smooth bundle having rank r. 
Proposition 6.1 (Pullback bundle). Let M and N he smooth manifolds and let (f): M ^ N be smooth. If 

rF := {(m, /) e Af X F I (m) = TT (/)} , 

and 

then (J^, (/)*F, TT*^ ^, Af) defines a smooth bundle. In particular, (p* F is a smooth manifold having dimension 
dim M + rankTT. The bundle defined by tt'^ ^ is called the pullback of tt by <j). 

Proof. Recalling that F denotes the typical fiber of tt, let ^f: tt^^ ([/) U x F trivialize tt over open set 
U CN. Define 

: 0* {tt-' ([/)) ^ [U] X F, {m, f) ^ (m, pr^^^^-^ ovj/ (/)) 

and 

: iU)xF^ 0* {tt-' {U)) , (m, /) ^ (m, (0 (m) , /)) . 

Claim (1): and 'i''^' are smooth. Proof: 0* {tt-' {U)) C (j)-' [U) x tt-' {U), and is clearly smooth 
as a map defined on the larger manifold. Therefore it restricts to a smooth map on 0* {tt-' (U)). An 
analogous argument shows that 'i'T' is smooth. Claim (1) proved. 
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Claim (2): inverts Proof: Let (m, /) e (jf (tt"! {U)). Then 

o *0 (m, /) = vp-i (m, pr2^><^ ov|/ (/)) 

= (m,*-i(0(m),pr2^>^^o*(/))) 

= (m,*-i(7r(/),pr^x^o*(/))) (since (m) = tt (/)) 

= (m, (prf (/) , pr^^x^ (/))) 

= (m,f-io*(/)) 

= (m,/). 

With 5 e J", 

*0 o (m, 5) = (m, (</. (m) , g)) 

= {m,v4''^ oiH-^ {cl>{m) ,g)) 

= {m, g) , 

proving Claim (2). 

Claim (3): trivializes tt'^*'^ over {U) C M. Proof: Let (m,/) e (tt"! (C/)). Then 

p,^-(^)x^ o*, (m, /) = prf'^^^^^ o (m, pr.^^^ „^ (^)) = ^ = ^^'f ^ 

and by claims (1) and (2), is a difFeomorphism, so trivializes tt*^ ^ over (/>~^ ([/) C M. Claim (3) 
proved. 

Since M can be covered with sets as in claim (3) and since the typical fiber of tt"^ ^ is diffeomorphic to T, 
this shows that n'^ ^ defines a smooth bundle (j^ , (j)* F, tt"^ ^ Because (j)*F is locally diffeomorphic to 

the product of an open subset of M with <p*F has been shown to be a smooth manifold having dimension 
dimM + dim J" = dimM + rankTT. □ 

While the puUback bundle is constructed as a submanifold of a direct product, there is a natural bundle 
morphism into the pulled-back bundle, which serves as an interface to maps defined on the pulled-back bundle. 
Usually this morphism is notationally suppressed, just as naturally isomorphic spaces can be identified 
without explicit notation. 

Corollary 6.2 (PuUback fiber projection bundle morphism). If cf): M ^ N is smooth, then 

p^^-.^F ^ F, 
(m, f) ^ f 

is a smooth bundle morphism over (p which is an isomorphism when restricted to any fiber of cf)*F. 

Because ^ is the projection pr^^^ |0*f, its tangent map is also just the projection pr^^®"^^ |t0*f- 
Proposition 6.3 (Bundle puUback is a contravariant functor). The map of categories 

Fullback: Manifold {Bundle{M) \ M e Manifold} , 

M Bundle (M) , 

{(p-.M^N) ^ (^Bundle (N)^ Bundle (M), {J^, F, TT, N) ^ (^J^,(j)*F,TT''''^,M^^ 

is a contravariant functor. Here, naturally isomorphic bundles in Bundle{M), for each manifold M, are 
identified (along with the corresponding morphisms). 
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Proof. Noting that 

lA*^F^{[n,J)^N xF\ Idjv {n) = ix (/)} - F 

and that 



(Id^ vr) (n, /) = (prf |m;, f) {nj)^n^7r (/) 
=^ Id^ TT = vr, 

it follows that Fullback (Id jv) = I'-'^Bundle(JV) ~ ^'-^Fullback(Af)' PuUback satisfies the identity axiom of 
functoriality. 

For the contravariance axiom, let (p: M N and tJj: L M he smooth manifold morphisms and let 
{J-, F, TT, N) be a smooth bundle. Then 

^p*(f>*F ^ \^{£,p) e Lx <j>*F {£) = n'f''^ (p)} 

= {{i, {mJ))eLx [M xF)\^{£) = tt^*^ (m, /) and ^ (m) = tt (/)} 

= {{i, (m, f))(zLx [M xF)\iP{i)^ m and (m) = tt (/)} 

-{(^,/)eLxf^|0oV;W-7r(/)} 

= {(t>o^:)* F 

and 

^'^***^ (^, (m, /)) - (prf 1^.^.;^) (^, (m, /)) = £ and 

showing that ■k'^''*'''' ^ 7r(0°V')*-F^ and therefore 

Fullback {tp) o Fullback (cj)) = Fullback (0 o i/') , 
establishing Fullback as a contravariant functor. □ 
The space of sections of a puUback bundle is easily quantified. 

F {(b*F) = {<T e (Af, (j)*F) I ^"^'^ o a = Mm} ■ 

This space will be central in the theory developed in the rest of this paper. Furthermore, it is naturally 
identified with the space of sections along the puUback map; 

(F) := {S e (M, F) I o S = 0} . 

These spaces are naturally isomorphic to one another, and therefore an identification can be made when 
convenient. While the former space is more correct from a strongly typed standpoint, the latter space is a 
convenient and intuitive representational form. The particular correspondence depends heavily on the fact 
that (j)*F is a, submanifold of M x F. 

T{c^*F) = r^{F) 

a ^ pr^'^^'oa. 

Furthermore, if / G F (F), then / o (/) e F^ {F). Note that it is not true that any a G F^ [F) can be written 
as / o (/) for some f £ T (F), for example when there exists some distinct p,q G M such that (p) = (j) (q) 
and a (p) ^ a (q). Furthermore, the representation / o is generally non-unique, for example when (p is not 
surjective, sections /i, /2 G F (F) which differ only away from the image of (f> will still give fi o (p = f2 ° (p- 
Before developing the notion of a linear connection on a puUback bundle, it will be necessary to address 
these features which, while inconvenient, provide the strength of the puUback bundle and puUback covariant 
derivative (see (8.8)). 
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Lemma 6.4 (Local representation of (F) elements). Recall that r denotes the rank of smooth bundle 
F. If a Cz (F) then each point p Cz M has some neighborhood U in which a can be written locally as 
\u= fi° <P \u, where fi,...,fr e T (i^ \4>(u)) is a frame for F \^(u), and (t\ . . . ,(t'~ G C°° (C/,M) are 
defined by a' = (/' o (p -p a \u- 

Proof. Let p e M, let y C be a neighborhood of (p) over which F \v is trivial, and let U — cj)^^ (V), 
so that L/ is a neighborhood of p. Let fi,...,fr G T {F \v) be a frame for F \v (i.e. F \^(u)); ^^i^d let 
...,/'" G r ((F \v)*) be the corresponding coframe (i.e. the unique . . . , such that /* -f fj — for 
each ij). Define a' G C°° (M,R) by cr* = {f' o cf) \u) -p a \u. Then 

= {{f^ o (f> \u) (^U (/'■ o Ic/)) -F <J \u 
= {{f^(S)V f')°(f'\u) -FCr It/ 

= (WfIv, o</> \u) -f ct |c/ 
= o- Ic/, 

as desired. □ 

Some literature uses expressions of the form / o g {F) along with an implicit use of the section- 
identifying isomorphism to write down particular sections of puUback bundles. In most cases, this tacit 
identification of spaces is harmless, but certain highly involved calculations may suffer from it. The section 
that f o (f) corresponds to under said isomorphism is Mm 'Xm {f o(f>) S r{(f>*F). However, because this 
expression is unwieldy and therefore a more compact and contextually meaningful expression is called for. 

Definition 6.5 (PuUback section). If / G F (F) and (j>: M N is smooth, then define 

rf ■■=ldMXM{fo(j)) eriq}*F). 
This is known pullback section. 

The pullback section is deservedly named, li (f): M N and ip: L ^ M are smooth, then ijj*4>*f ^ 
{</) o jp)* f in the sense of the proof of (6.3). 

Proposition 6.6 (Bundle pullback commutes with tensor product). If E and F are smooth vector bundles 
over manifold N and (p: M N is smooth, then the map 

(p*E®M<t>*F (P*{E(E)nF), 
(m, e) ®M {m, f) ^ (m, e (S)n /) 

(extended linearly to general tensors) is a smooth vector bundle isomorphism. 

Proof. Let c denote the above map. The well-definedness of c comes from the universal mapping property on 
multilinear forms which induces a linear map on a corresponding tensor product. If c ((m, e) (to, /)) = 0, 
then e (E)n f — 0, which implies that e = or / = 0, and therefore that (m, e) {fn,f) = 0. Because 
there exists a basis for {(p*E ®m 4>*F)m consisting only of simple tensors, this implies that c is injective, and 
by a dimensionality argument, that c is an isomorphism. The map is clearly smooth and respects the fiber 
structures of its domain and codomain. Thus c is a smooth vector bundle isomorphism. □ 

The contravariance of pullback and its naturality with respect to tensor product are two essential prop- 
erties which provide some of the flexibility and precision of the strongly typed tensor formalism described in 
this paper. This will become quite apparent in Part II. 
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Remark 6.7 (Tensor field formulation of smooth vector bundle morphisms). A particularly useful application 
of puUback bundles is in forming a rich type system for smooth vector bundle morphisms. This approach was 
inspired by [ , pg. 11]. Let tt^ : E ^ M and tt^ : F ^ N he smooth vector bundles, and let (p: AI N he 
smooth. Consider Hom^ {E, F), i.e. the space of smooth vector bundle morphisms over the map (j>. There is 
a natural identification with another space which lets the base map play a more direct role in the space's 
type. In particular, 

Hom^(£;,F) - RoraM,AE,<P*F), 
prf^^^oB ^ B. 

This particular identification of smooth vector bundle morphisms over </> can now be directly translated into 
the tensor field formalism, analogously to (3.1). 

Ti(l>*F(g,ME*) ^ Homid,, (^,0*F), 
A ^ {ei-^ A-E e) . 

The inverse image of _B e Homidjvj {E, 4>* F) is given locally; let (e^) and (/J denote local frames for E and 
F in neighborhoods U C M and V N respectively, with ([/) C V, and let (e*) and (/*) denote their 
dual coframes. Then the tensor field corresponding to B is given locally in U by Bj(f>*fi (E}m , where 
Sj := (l)*dr oBoej&C°° {U, M). 

Quantifying smooth vector bundle morphisms as the tensor fields lends itself naturally to doing calculus 
on vector and tensor bundles, as the relevant derivatives (covariant derivatives) take the form of tensor fields. 
The type information for a particular vector bundle morphism is encoded in the relevant tensor bundle. 

7 Tangent Map as a Tensor Field 

This section deals specifically with the tangent map operator by using concepts from Section 5 and Section 
6 to place it in a strongly typed setting and to prepare to unify a few seemingly disparate concepts and 
notation for some tangible benefit (in particular, see Section 10). 

Given a smooth map M — > N , its tangent map T0: TM — > TN is a smooth vector bundle morphism 
over (/), so by (6.7), is naturally identified with a tensor field 

V e r {(j3*TN ®M T*M) , 

which may be denoted by V where type pedantry is deemed unnecessary. This construction is known as a 
two-point tensor field [ , pg. 70]. The inscribed o symbol in V is used to denote that this is a nonlinear 
derivative, thereby distinguishing it from a linear covariant derivative. 

Remark 7.1 (Generalized covariant derivative). The well-known one-to-one correspondence between linear 
connections and linear covariant derivatives [ , pg. 520] generalizes to a one-to-one correspondence between 
Ehresmann connections and a generalized notion of covariant derivative. To give a partial definition for the 
purposes of utility, a generalized covariant derivative on a smooth ]fiber] bundle — t' is a map V on 
r (F) such that Vcr G T {a*VF®N T* N) for each a (^T{F). The space of maps (M, N) is naturally 
identified as r(7V^M), and there is a natural Ehresmann connection on the bundle N ^ M, whose cor- 
responding covariant derivative is the tangent map operator. This is the subject of another of the author's 
papers and will not be discussed here further. This is mentioned here to incorporate linear covariant deriva- 
tives (to be introduced and discussed in Section 8) and the tangent map operator (a nonlinear covariant 
derivative) under the single category "covariant derivative". 
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There is a subtle issue regarding construction of the cotangent map of (j) which is handled easily by the 
tensor field construction. In particular, while the cotangent map T*(j) is the pointwise adjoint of the tangent 
map Tcj), i.e. for each p € M, Tpc/j: TpM -J> T^{p)N is linear and T*4>: T*^p)N -J> T*M is the adjoint of Tpc/j, 
it does not follow that T*(j)e Hom {T*N, T*M), being some sort of "total adjoint" of T(p E Horn (TM, TN). 
The obstruction is due to the fact that (f) may not be surjective, so there may be some fiber T*N that is not of 
the form T'^^^^N , and therefore the domain could not be all of T* N. Furthermore, even if were surjective, 
if it weren't also injective, say (/i(po) = 4'iPi) for some distinct pQ,pi G M, then T^(j,g)N = '^4>{pi)^^ ™^ 
TpgM ^ Tp^M, so the action on the fiber T'^(^^)N is not well-defined. 

In the tensor field parlance, the cotangent map T*<j) simply takes the form 

(V0)(^2^ £T{T*M®M(ty*TN). 

The permutation superscript (1 2) is used here instead of * to distinguish it notationally from puUback nota- 
tion, which will be necessary in later calculations. The key concept is that the tensor field ( V0) encodes 
the base map 0; the basepoint p € Af is part of the domain (j>*T*N itself. 

The chain rule in the tensor field formalism makes use of the bundle puUback. If -0 : i — > A/ is smooth, 
then 

Because V t/j G T {i/j*TM (g)^, T*L), to form a well-defined natural pairing, the use of the puUback 

V-* V </> e r (V'* {(t)*TN ®M T*M)) = r {i:*(j)*TN (E)L i)*T*M) = T o V)* TN i^*T*M) 
is necessary (instead of just V G T {(j)*TN (8)m T*M)). 

Sometimes it is useful to discard some type information and write V (E T^XMidAf {TN ^nxm T*M), 
i.e. V (t): M ^ TN i^nxm T*M such that {irj/^ ®n-km t^Ii'''') o V cj) ^ (/) Xm Wm- This is easily 
done by the canonical fiber projeciton available to all pullback bundle constructions; (j)*TN (E)m T*M = 
{4> Xjv/ Wm)* {TN (E)NxM T*M), and the canonical fiber projection is 

/^fe^xMr-If^""'^*''' ■■ Wm)* {TN (g>^^M T*M) TN ®mxm T*M, 

as defined in (6.2). The granularity of the type system should reflect the weight of the calculations being 
performed. For demonstration of contrasting situations, see the discussion at the beginning of Section 8 and 
the computation of the first variation in (12.1). 

It is important to have notation which makes the distinction between the smooth vector bundle morphism 
formalism and the tensor field formalism, because it may sometimes be necessary to mix the two, though 
this paper will not need this. An added benefit to the tensor field formulation of tangent maps is that certain 
notions regarding derivatives can be conceptually and notationally combined, for example in Section 10. 



8 Linear Covariant Derivatives 

As will be shown in the following discussion, a linear covariant derivative (commonly referred to in the 
standard literature without the "linear" qualifier) provides a way to generalize the notion in elementary cal- 
culus of the differential of a vector-valued function. The linear covariant derivative interacts naturally with 
the notion of the pullback bundle, and this interaction leads naturally to what could be called a covariant 
derivative chain rule, which provides a crucial tool for the tensor calculus computations seen later. 

Let V and W be finite-dimensional vector spaces let U C V be open, and let 0: U W he differentiable. 
Recall from elementary calculus the differential Dcf): U W ® V* (essentially matrix- valued) . There 
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is no base map information encoded in D(f) (i.e. </> can't be recovered from D(f) alone), it contains only 
derivative information. The vector space structure of V and W allows the trivializations TU = V "^U and 
TW = W^ W, where the first factors are the base spaces and the second factors are the fibers (see (4.1)). 
The tangent map V '^^^0: U TW <E)wxu T*U (see Section 7) has a codomain that can be trivialized 
similarly; 

TW (g)wxuT*U ^{W^W) (g)wxu {V*^U) (W(g)V*) ^ {W x U) . 

Because {W (g) V*) ^ {W x U), as a set, is a direct product, it can be decomposed into two factors. Letting 
prj^ and pr2 be the projections onto the first and second factors respectively, 

prioV0: and pr^oV (j): U ^ W x U. 

The map prj oV (j)\s the element of F {W » U) identified with the base map (f> itself; pr}^^*^ o prj o V </) = 
This base map information is discarded in defining the differential of (f) as D(f) :— prj^ o V 0; the fiber portion 
of V (f). This construction relies critically on the natural isomorphism TW '^W ^W for a vector space W . 

An analogous construction shows that the differential Dcf) of a map (j) is well-defined even when its domain 
is a manifold. However, when the codomain of a map (j) is only a manifold, there does not in general exist a 
natural trivialization of its tangent bundle (in contrast to the vector space case), and therefore D(j) can't be 
defined without additional structure. A linear covariant derivative provides the missing structure. 

For the remainder of this section, let tt : _E — > define a smooth vector bundle having rank r. 

A linear covariant derivative on E provides a means of taking derivatives of sections of E (i.e. maps 
a: N ^ E such that t: o a — Id^) without passing to a higher tangent bundle as would happen under the 
tangent map functor (i.e. if cr e F (E) then Ta:TN^ TE and V ^^'^a: N -J> TE(g)ExN T*N). A Hnear 
covariant derivative provides an effective "trivialization" of TE analogous to the trivialization TW ^ W ^W 
as discussed above, discarding all but the "fiber" portion of the derivative, allowing the construction of an 
object known as the total linear covariant derivative analogous to the differential D(j) as discussed above. 

The notion of a linear covariant derivative on a vector bundle is arguably the crucial element of differential 
geometry^. In particular, this operator implements the product rule property common to anything that can 
be called a derivation - a property which is particularly conducive to the operation of tensor calculus. The 
total linear covariant derivative of a vector field (i.e. section of a vector bundle) allows the generalization 
of many constructions in elementary calculus to the setting of smooth vector bundles equipped with linear 
covariant derivatives. For example, the divergence divA :— tr DX of a vector field X on M" generalizes to 
the divergence div A := tr VA of a vector field A on A, which has an analogous divergence theorem among 
other qualitative similarities. 



Remark 8.1 (Natural linear covariant derivative on trivial line bundle). Before making the general definition 
for the linear covariant derivative, a natural linear covariant derivative will be introduced. With N denoting 
a smooth manifold as before, if / e C°° {N, M), then df eV {T*N) is the differential of /. Let 



N- 



7 := df. 



Because C°° {N, M.) is naturally identified with F (M » N) , this is essentially the natural linear covariant 
derivative on the trivial line bundle M x A^. Note that there is an associated product rule; if /, g E C°° {N, M), 
then /g G C°° (Af,K), and 

V (/g) = d ifg) ^gdf + fdg^gW + / V 

When clear from context, the superscript decoration can be omitted and the derivative denoted as V /. 



^The Fundamental Lemma of Riemannian Geometry establishes the existence of the Levi-Civita connection! > Pg- 
which is a linear covariant derivative satisfying certain naturality properties. 
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Definition 8.2 (Linear covariant derivative). A linear covariant derivative on a vector bundle defined 
by tt: i? ^ is an M-linear map V ^ : T (E) ^ T {E ®n T* N) satisfying the product rule 

V^(/®Ar(7) = a®Ar V^^"/ + /®Ar V^cr, (8.1) 

where / e C°° {N,M.) and a G T (E). The switch in order in the first term of the expression is necessary to 
form a tensor field of the correct type, T {E T*N). If cr S F (E), then the expression V ^cr is known 
as the total [linear] covariant derivative of a. If V = [in a subset U N], then cr is said to be 
parallel [on U\. The "linear" qualifier is implied in standard literature and is therefore often omitted. 

The inscribed i in V is to indicate that the covariant derivative is linear, and can be omitted when clear 
from context, or when it is unnecessary to distinguish it from the nonlinear tangent map operator whose 
decorated symbol is V . For the remainder of this section, this distinction will not be necessary, so an 
undecorated V will be used. 

For y e F (TN), it is customary to denote W^a-V by Vycr, where V indicates the "directional" component 
of the derivative. Following this convention, the product rule can be written in a form where the product 
rule is more obvious; 

(/ (g>N fT) - V^^«/ (g>NCT + f(g>N V^a. 

A covariant derivative is a local operator with respect to the base space N; if p £ N, then (V^cr) (p) 
depends only on the restriction of a to an arbitrarily small neighborhood of p [ , pg. 50[, and therefore 
the restriction V'^'^ : F[/ {E) — > F[/ {E T*N) makes sense, allowing calculations using local expressions. 
Furthermore, a covariant derivative can be constructed locally and glued together under certain conditions. 
See [■" , pg. 503[ for more on this, and as a reference for general theory on bundles, covariant derivatives, and 
connections. 

Linear covariant derivatives on several vector bundle constructions will now be developed. In analogy 
to defining a linear map by its action on a generating subset (e.g. a basis or a dense subspace) and then 
extending using the linear structure. Lemma (8.6) allows a covariant derivative to be defined on a generating 
subset (which can be chosen to make the defining expression particularly natural) and then extending. In 
this case, the relevant space is the space of sections of the vector bundle, which is a module over the ring 
of smooth functions on a manifold, and the extension process is done via linearity and the product rule 
(see (8.2)). This approach will allow the local trivialization implementation details to be hidden within the 
proof of Lemma (8.6) - an example of information hiding - so that constructions of covariant derivatives 
can proceed clearly by focusing only on the natural properties of the relevant objects and then invoking the 
lemma to do the "dirty" work (see (8.7) and (8.9)). 

A bit of useful notation will be introduced to simplify the next definition. If G C F is a subset of a 
C°° {N, M)-module F whose elements are functions on N (and therefore have a notion of restriction to a 
subset) and U C N is open, then let Gu denote the set of restrictions of the elements of G to the set U. 
Note that Gjj C F[/ by construction. 

Definition 8.3 (Finitely generating subset). Say that a subset of a module finitely generates the module 
if the subset contains a finite set of generators for the module. 

Definition 8.4 (Locally finitely generating subset). If F is a C°° (iV, M)-module and G C F, then G is said 
to be a locally finitely generating subset of F if each point q G N has a neighborhood U <^ N for which 

G(j finitely generates Tij. 

The space of sections of a vector bundle is the archetype for the above definition. The locally trivial 
nature of tt : E ~¥ N allows local frames to be chosen in a neighborhood of each point of N, from which global 
smooth sections (though not necessarily a global frame) can be made using a partition of unity subordinate 
to the trivializing neighborhoods. The set of such global sections forms a locally finite generating subset of 
TiE). 
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Lemma 8.5. If G is a locally finitely generating subset ofT(E), then each point in N has a neighborhood 
U Q N and ei, . . . , G Gu such that ei, . . . , forms a frame for Tjj (E). In other words, a local frame can 
be chosen out of G near each point in N. 

Proof. Let q E N and let C be a neighborhood of q for which Gy — {.9i, • ■ • , gt} finitely generates 
Ty [E) (here, i > r, recalling that r = lankE'). Without loss of generality, let gi (q) , . . . ,gr (q) be lin- 
early independent (this is possible because {gi (q) , . . . ,ge (q)} spans the vector space Eq). Because gi is 
continuous for each i and the linear independence of the sections gi, gr is an open condition (defined by 
L^^ (M\ {0}) where L: N ^ /\^ Eq, p t-^ gi (p) A ■ ■ ■ A gr (p)), there is a neighborhood C/ C F of g for which 
{gi (p) , . . . , gr (p)} is a linearly independent set for each p £ U . Finally, letting e.; gi \u for « G {1, . . . , r}, 
the sections ei, . . . , e^. € Gjj form a frame for Vu [E). □ 

The following lemma shows that defining a covariant derivative on a locally finitely generating subset of 
the space of sections of a vector bundle is sufficient to uniquely define a covariant derivative on the whole 
space. The particular generating subset can be chosen so the covariant derivative has a particularly natural 
expression within that subset. 

Lemma 8.6 (Linear covariant derivative construction). Let G be a locally finite generating subset ofT (E). 
If : G — > r (E (E)N T* N) satisfies the linear covariant derivative axioms^, then there is a unique linear 
covariant derivative : F (E) T {E (E)n T* N) whose restriction to G is . 

Proof. If q G N, then by (8.5) there exists a neighborhood U C N of q for which there are ei, . . . , G Gjj 
forming a frame for E If a S F(i?), then a \u= cr^ei for some a^,...,a^ S C°° iU,M.) (specifically, 
cr' = e' (7 |[/, where e^, . . . , e'' € F^ (E*) denotes the dual coframe of ei, . . . , e,.). Define : F (E) — > 
F (E (8)jv T*N) locally on Tjj (E) so as to satisfy the product rule 

(cr 1,7) := e, (g)N V^^"cr' + a' (g>N V^e^. 

To show well-definedness, let /i, . . . , € Gu be another frame for E Then cr = r'/^ for some r^, . . . , r*" G 
G°° {U, E). Let \I/ : Tu (E) Tij (E) be the unique smooth vector bundle isomorphism such that fi = VP-^ei. 
Writing 4' and ^!~^ with respect to the frame (e^) as ^'^ e^ ig) and ("il^^y. Ci <Si respectively, it follows 

that /, = ^e-j and t' = Then 

- ^le, ®N V^-^« (a^ (*-!);) + {^-^^ ®^ V« (M^f e^) 

The last equahty follows because ^I'J'' (^'^^)^ = S^, which is a constant function, so V^~** (^^i (^P^^)^^ — 0. 

Thus the expression defining doesn't depend on the choice of local frame. This establishes the well- 
definedness of V'^. 

Clearly the restriction of to G is V*^. This establishes the claim of existence. Uniqueness follows 
from the fact that is defined in terms of the maps v^^'^ and V*^. □ 

Lemma (8.6) is used in the proof of the following proposition to allow a natural formulation of the 
puUback covariant derivative with respect to a natural locally finite generating subset of F((/)*£'), in which 
the relevant derivative has a natural chain rule. 

*What is meant by this is that the product rule must only be satisfied on A®jv 3 if £ G, where A £ C°° (A'^, R) and g £ G. 
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Proposition 8.7 (Pullback covariant derivative). If (p: M ^ N is smooth and is a covariant derivative 
on E, then there is a unique covariant derivative ^ on (p* E satisfying the chain rule 

V^'^r e = 0* V^e -^.TN V 

for all eer{E). 

Proof. Let G := {a E T {(j>*E) \ a = (f>*e for some e e F (E)}, noting that a local frame ei, . . . , Bj-ankB G 
T(j (E) over open set U C N induces a local frame (f>*ei, . . . , '/'*ei.ank£; € ^<ii-^(u) so G is a locally 

finite generating subset of F {(f>*E). Define 

V^:G ^ T{(i3*E®MT*N), 

0*e ^ <?!)*V^e-0.TAr V^^-'^^. 

The well-definedness and M-linearity of V comes from that of V^. For the product rule, if A G C°° (Af,M) 
and e € F [E), then the product \ ®m </'*e is an element of G if and only \i X — (p* ^ for some fi E G°° {N, M), 
in which case, A (E)m <P*e 

V'^(A(8)M <P*e) = 



which is exactly the required product rule. By (8.6), there exists a unique covariant derivative ^ on (f>* E 
whose restriction to G is V'-^. □ 

The full notation ^ is often cumbersome, so it may be denoted by when the pulled-back bundle 
is clear from context. 

Remark 8.8. There is an important feature of a pullback covariant derivative in the case that pullback map is 
not an immersion; the pullback covariant derivative may be nonzero even where the pullback map is singular. 
This fact can be obscured by a certain abuse of notation which often comes in the expression of the geodesic 
equations in differential geometry (see (12.8)). An example will illustrate this point. 

Let V^*^ be a covariant derivative on : TM — ^ M. Let 6 : M ^ T1\I be a unit-length vector field 
which describes the location of a person (the basepoint) and direction s/he is looking (the fiber portion) 
with respect to time (let M have standard coordinate t). Define 0: M — M by 6' := ttJ/^ o 8, so that 6 is 
the base map of Q, i.e. 9 has discarded the direction information and only encodes the location information. 
Say that for some closed interval ICR, ^ is identically zero (and so is not an immersion), but that 
^ 1/ is nonvanishing; see Figure 8.1. Mathematically, this means that during this time, Q is varying only 
within a single fiber of TM. Physically, this means that during this time, the person is standing still but 
the direction s/he is looking is changing. Passing to a higher tangent space is often undesirable (note that 
^ takes values in TTM), so to avoid this, a covariant derivative is used. In order to be meaningful, the 
covariant derivative must capture this fiber-only variation. 

Because 8 is a vector field along 9, it can be written as 8 G F {9*TM), and the covariant derivative on 
TM induces a pullback covariant derivative on 9*TM, which has base space M. In other words, 9*TM is 
parameterized by time. Then V^-^^^8 E r{9*TM) is the desired covariant derivative of 8 with respect to 

di 

time. A coordinate-based calculation will be made to make completely obvious why this pullback covariant 
derivative captures the desired information. Let (2;*) be local coordinates on M and, for simplicity, assume 
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— (XiAf (j)*e — (j)* (^ (g)jv e). Then it follows that 

: V^0* (M«)Are) 
:</)*V^(M(»ive)-^.TJV 
(t>* (e (g>N V^^V + M <»JV V^e) -^^tn V 

= cp*e ®M V^^«M -^.TAT V ^^^^(/.) + 0> ®M V^e -^^tn V ^^^^0) 

= <t)*e ®M V*^^"A + A ®M V^0*e, 




Figure 8.1: A picture of the manifold M , path 9, and vector fields 0' and <d. The blue dots represent 6 (t) at 
certain points t e M. while the green and red arrows represent 9' (t) and O (t) at at these points respectively. 
Note that is a unit-length vector field along 9 and varies within /, whereas 6' is a vector field along 9 that 
vanishes within /. 



that the image of 6 lies entirely within this coordinate chart. Because (9,) is a local frame for TM, {9*di) 
is a local frame for 9*TAL by (6.4) and 9 G T {9*TM) can be written locally as 6 [t) = 6^ {t) {9*8-) (t) for 
some functions (0* : M ^ M) . Then 



dt dt 

\ dt / dt 



at at 



^9*d. + &9*V™d.-e.TM^. 
at at 



Note that V ^^^^9 G T{9*TM). Within the interval /, ^ vanishes, so the second term vanishes on /. 

However, because O is varying in a fiber-only direction within /, the basepoint is not changing and ^^9*di 
can be identified with an elementary vector space derivative (the fiber is a vector space and so an elementary 
derivative is well-defined there). This fiber-direction derivative is nonvanishing by assumption, so V^'^-^^^O 

dt 

is nonvanishing on / as desired. 

Introducing a bit of natural notation which will be helpful for the next result, if X E T (E) and Y E T (F), 
then define X (SY = X ©mxjv Y eT {E®mxn F) and X (^Y = X (Smxn Y er{E (^mxn F) by 

{X®MxNY){p,q):^X{p)®Y{q) and {X <»mxN Y) {p,q) -.^ X {p) (8)Y (q) 

for each (p, q) £ M x N. 
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Proposition 8.9 (Induced covariant derivatives on E(BmxnF and E^mxnF). IfV^ and are covariant 
derivatives on E and F respectively, then there are unique covariant derivatives 

VB®MX«F . p ^^^^^ ^ p ((^ ^^^^^ ^^^^^ (J.*^^ ^^^^^ J.*^)) 

and 

VB®Mx«F . p ^^^^^ F) ^ r {{E (g>MxN F) (g,MxN {T*M ®MXN T* N)) 

on E (B F and E ^S) F respectively, satisfying the sum rule 
and the product rule 

respectively, where X e T {E), Y e T (F), and u (B v e TM ® TN. Here, TM TN M x N (and its 
dual) is used instead of the isomorphic vector bundle T {M x A^) M x N (and its dual). 

Proof. Suppressing the pedantic use of the M x N subscript to avoid unnecessary notational overload, the 
set G {e e / I e e r (£;),/ e r {F)) is a locally finite generator oiT{E® F), since local frames for E®F 
take the form {et ® 0, /j}, where {ci} and {/j} are local frames for E and F respectively. Define 

V^:G ^ T{{E®F)®{T*M®T*N)), 

X®Y ^ {u®v^ Vf X Vf Y) , where u®ve TM ® TN. 

This map is well-defined and M-linear by construction, since the connections and are well-defined 
and M-hnear. If X e C°° {M x N,R), X e T (E), and F e T (F), then the product A (g) (X © F) is in G (i.e. 
has the form X ® F for some X G T [E) and F G F (F)) if and only if A is constant. Thus the product rule 
(restricted to elements of G) reduces to M-linearity, which is already satisfied. By (8.6), there exists a unique 
connection V'^®''^ on E <S) F whose restriction to G is V*^. 

Similarly, the set := {e €5 / | e G F {E) , f £T (F)} is a locally finite generator of F (F (g) F), since local 
frames for E ^ F take the form {e^ €5 fj}, where {e^} and {fj} are local frames for F and F respectively. 
Define 

V":H T {{E (g) F) ^ (T* M ® T* N)) , 

X(g>Y i-^ (uewM- Vf X (g)Y + X (g)V^Y) , where u®v e TM © TN. 

This map is well-defined and M-linear by construction, since the connections and are well-defined and 
M-linear. For the product rule, with A € G°° (M x iV, M), X e F (F), and F e F (F), the product A(g)(X (g Y) 
is in H if and only if there exist ^ e G°° (M, M) and ly e C°° {N, R) such that X = e (M ^ Af ) (g) (M ^ iV) 
(noting that then A (gMxAf {X (^Y) = (^ (g z^) (gA/xw (^ (g F) = {f^(^M X) g) (z^ (gjv y))- In this case, with 
u®v€TMG) TN, 

Vfe. {X<S)m>cn{X<»Y)) 

= ^u(Bv ((M ® v) (E)MxN {X (g F)) 

= Vf (At g)M ^) (g (i^ (gw Y) + {fi<g)M X) g) Vf (z^ (gw y) 
= (Vf ^"^Ai <»M X) (g (;/ (gjv F) + (Ai (gM X) g) (z/ g)Ar F) 

+ (/i(gAfX)(g (V^^~"^i/g)wy) + (Ai<gM^)«) (z^fgAT VfF) 
= (Vf ^ V ® + M «) ^"^z^) ^AfxAT (X«)F) A^Afxiv (Vf X g) F + X (g F) 

= V^^^-^^A (g>MxN (X ® F) + A (gAf xiV Vfg3„ (X (g F) , 

which is exactly the required product rule. By (8.6), there exists a unique connection V^**^ on F(gF whose 
restriction to iJ is V^. □ 
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Remark 8.10 (Naturality of the covariant derivatives on E®mxnF and EiS)mxn F). Letting pr^ := pr^^''^ 
(i S {1, 2}) for brevity, the maps 

^■.E(BmxnF ^ prj £:®Mx7vpr2-F, 

e®/ ^ ((7r^®7r^)(e©/),e)©Mxiv(K®7r^)(e®/),/) 

and 

ip:Ei^MxNF prii;(g)MxJvpr2-F, 

6 0/ ^ ((7r^(8)7r^)(e(g)/),e)(^MxAr((7r^®7r-^)(e «)/),/), 

each extended linearly to the rest of their domains, are easily shown to be smooth vector bundle isomorphisms 
over Umxn- Then 

and 

vf (X ^ r) = (vr* ^^"x^p'-^ (X ® y)) 

for all X G r (£;), y G r (i^), and z eT {M x N), showing that the connections on E ® F and E ® F a.ie 
and V'-related to the naturally induced connections on pr* E®pr2 F and pr* E0mxnP^2 F respectively, and 
are therefore in this sense natural. The sum X (BY E T {E (B F) and product X (E)Y E T {E (E) F) correspond 
to pr^ X (Bmxn W2 ^ P'^i ^ ®MxJV W2 ^ G r (pr^ E (^mxn pr2 F) under ^mathn?/; respectively. 

Many important tensor constructions involve permutations. An extremely useful property of these per- 
mutations is that they commute with the covariant derivatives induced by the covariant derivatives on the 
tensor bundle factors, making them natural operators in the setting of covariant tensor calculus. 

Proposition 8.11 (Transposition tensor fields are parallel). Let Ei,E2,Ez,Ei he smooth vector bundles 

over M having covariant derivatives V"^^ , V^^ , V^^ , V"^*' respectively, let A Ei ®m E2 (E)m <S)m and 
B := El ®M E3 ®M E2 ®M E4, and let V'^ and denote the induced covariant derivatives. 

If (2 3) G r {A* 0M B) denotes the tensor field which maps ei 0m ^^2 ®m 63 ®m G A to e\ ®m 63 0m 
62 0M 64 G B (i.e. (2 3) transposes the second and third factors), then (2 3) is a parallel tensor field with 
respect to the covariant derivative induced on the vector bundle A* 0m B — ^ M, i.e. V'^ ®mB (2 3) = 0. 

Proof. Let X G r [TM). Then 

(ei 0M 62 0M 63 0M 64) -A' Vf (2 3) 

= ((ei 0M 62 0M 63 0M 64) -A' (2 3)) - Vx (ei 0m 62 0m 63 0m 64) -a* (2 3) 
= Vf (ei 0M 63 0m 63 0M 64) 

- V;^'ei 0M 63 0M 62 0M 64 

- ei 0M Vf^es 0M 62 0M 64 

- 61 0M 63 0M V5^62 0M 64 

- 61 0M 63 0M 62 0M Vf*e4 

= Vf (ei 0M 63 0M 63 0M 64) - Vf (61 0M 63 0M 63 0M 64) 

= 0. 

Because X is arbitrary, this shows that (ei 0m 62 0m 63 0m 64) -a* V"^ ®mB ^23) = 0. This extends linearly 
to general tensors, so V'^ ®mB {2i) — 0, as desired. □ 

The fact that all transposition tensor fields are parallel implies that all permutation tensor fields are 
parallel, since every permutation is just the product of transpositions. This gives as an easy corollary that a 
covariant derivative operation commutes with a permutation operation, which has quite a succinct statement 
using the permutation superscript notation. 
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Corollary 8.12 (Permutation tensor fields are parallel). Let Ei, .. .,Ek be smooth vector bundles over M 
each having a covariant derivative, and let A := Ei®m' ' ■®MEk and B :— i?o--i(i)*8'M' • •'8)M-E'cr-i(fc)- G 
Sk is interpreted as the tensor field in T (A* (i)M B) which maps ei®M' ' ■®MGk to e.„-i(i)®M ■ • •<8>Meo--i(fc), 
then a is a parallel tensor field. Stated using the superscript notation, with X (TM) and a G F (A), 

Proof. This follows from the fact that a can be written as the product of transpositions; ~ because 

of the product rule and because each transposition is parallel. The claim regarding commutation with the 
superscript permutation follows easily from its definition. 

Vf a- = Vf (a -A' o) = a vf + V^a -a* o = (V^a)" , 

using the fact that ®mB ^ _ ^{yicg ct is a parallel tensor field. □ 

9 Decomposition of 7r|J^ : TE — > E 

In using the calculus of variations on a manifold M where the Lagrangian is a function of TM (this form 
of Lagrangian is ubiquitous in mechanics), taking the first variation involves passing to TTM. Without a 
way to decompose variations into more tractable components, the standard integration-by-parts trick [ , pg. 
16] can't be applied. The notion of a local trivialization of TTM via choice of coordinates on M is one way 
to provide such a decomposition. A coordinate chart {U,(j): U — M") on M establishes a locally trivializing 
diffeomorphism TTU = 4>{U) x M" x M" x M". However such a trivialization imposes an artificial additive 
structure on TTU depending on the [non-canonical] choice of coordinates, only gives a local formulation 
of the relevant objects, and the ensuing coordinate calculations don't give clear insight into the geometric 
structure of the problem. The notion of the linear connection remedies this. 

A linear connection on the vector bundle tt : — > A/ is a subbundle H ^ E oi tt^^ : TE — > E such 
that TE = H ®E VE and TAq • = Hax for all a e E\ {0} and x e E, where Xa'. E ^ E, e i-^ ae is the 
scalar multiplication action of a on i? ] , pg. 512]. The bundle H ^ E may also be called a horizontal 
space of the vector bundle tt^^ : TE E ("a" is used instead of "the" because a choice oi H E\& generally 
non-unique). For convenience, define h :— V tt G F {tt*TM (E)e T* E) , noting then that VE = ker h. 

A linear connection can equivalently be specified by what is known as a connection map; essentially a 
projection onto the vertical bundle. This is a slightly more active formulation than just the specification of 
a horizontal space, as a covariant derivative can be defined directly in terms of the connection map - see [m, 
pg. 518], ]-:, pg. 128], I , pg. 173], and [12, pg. 208]. 

Proposition 9.1 (Connection map formulation of a linear connection). If v £ T {tt* E (E)e T* E) (i.e. 
v. TE —^Eisa smooth vector bundle morphism over n) is a left-inverse for Ly^ € T {VE t^* E*) 
that is equivariant with respect to TXa and Aq (i.e. v ■ TXa — n* Xa ■ v) [V2, pg. 245], then H := keri; < TE 
defines a linear connection on the vector bundle t: : E ^ M. Such a map v is called the connection map 
associated to H . Conversely, given a linear connection H, there is exactly one connection map defining H 
in the stated sense. 

Proof. That w is a left-inverse for Lyj^ implies that v has full rank, so H := kerw defines a subbundle of 
ttIJ-^ : TE — > E having the same rank as TM. Because v is smooth, _ff is a smooth subbundle. Furthermore, 
the condition implies that VeEDHe = {0} for each e £ E, and therefore TE ~ H ®e VE by a rank-counting 
argument. 

If a; e TE and a G M\ {0}, then v ■ TXa ■ x = n* Xa ■ v ■ x, which equals zero if and only if u • a; = 0, i.e. if 
and only ii x € H. Thus TXa ■ H = H . This establishes H E as a, linear connection. 

Conversely, if iif is a linear connection and vi and V2 are connection maps for H , then vi -te i^ve ~ 
Id^ 'E — V2 -TE I'VE - Then because the image of is all of VE, it follows that vi \ve— V2 \ve- Since 
vi \h= = V2 |h by definition, and since TE — H (Be VE, this shows that vi = V2. Uniqueness of 
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Figure 9.1: A diagram representing the decomposition of TE — > E into horizontal and vertical subbundles. 
The vertical lines represent individual fibers of E, while p G M, Cp (z Ep, Op d Ep denotes the zero vector 
of Ep, and ZE denotes the zero subbundle of E; ZE = M. By the equivariance property of the linear 
connection, ZE is a submanifold of E which is entirely horizontal (its tangent space is entirely composed of 
horizontal vectors). The tangent spaces Tq^E and T^^E are drawn; green arrows representing the vertical 
subspaces ("along" the fibers), red arrows representing the horizontal subspaces. Finally, c is a horizontal 
curve passing through Cp. 

connection maps has been established. To show existence, define v :— lX'^e 'VE P^ve ^ T (ir* E (E}e T* E), 
where pr^/^ : H(BeVE — )■ VE be the canonical projection, recalling that H(BeVE = TE. It is easily shown 
that w is a connection map for H. □ 

Proposition 9.2 (Decomposing tt^^ : TE — > E). If v {n*E i^e T*E) is a connection map, then 

h®Ev:TE ^ Ti*TM®ETr*E (9.1) 
is a smooth vector bundle isomorphism over Id^. 5*66 Figure 9.1. 

Proof. Because TE — H ®e VE, and H = kcrv and VE = ker/i, the fiber-wise restriction 

h®Ev\T,E-TeE {T:*TM®ETr*E)^^T^^^)M®E^^^^ 

is a linear isomorphism for each e G E. The map is a smooth vector bundle morphism over Id^ by construc- 
tion. It is therefore a smooth vector bundle isomorphism over Id^. □ 
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Remark 9.3 (Linear connection/covariant derivative correspondence). Given a covariant derivative on a 
smooth vector bundle tt : _E M, there is a naturally induced linear connection, defined via the connection 
map 

v.TE E, (9.2) 

where 0: / — >■ £^ is a variation of 9 £ E. Here, V'-'^"®-' ^ denotes the puUback of the covariant derivative 
through the map tt o (see (8.7)). Conceptually, all v does is replace an ordinary derivative (5,) with 
the corresponding covariant one 

Conversely, given a connection map v G T {tt* E (E) e T* E) for a linear connection H ^ E, there is a 
naturally induced covariant derivative on the smooth vector bundle tt: E M, defined by 

v-^:r(£;) ^ r(E^MT*M), 

The scaling equivariance of v is critical for showing that this map actually defines a covariant derivative. 
Full type safety should be observed here; by the contravariance of the puUback of bundles (see (6.3)), 
a*7r*E ^{-Koa)* E = Id^j E'^ E,&o 

<j*v e r(CT* {tt*E®eT*E)) ^T{c7*'K*E(g)M a*T*E) '^T{E®m a*T*E) , 

and therefore a*v ■ V cr G T{E®mT*M) as desired. This connection map construction of a covariant 
derivative gives (8.7) as an immediate consequence via the chain rule for the tangent map. 

The following construction is an abstraction of taking partial derivatives of a function, inspired by [ I i , pg. 
277]. Instead of taking partial derivatives with respect to individual coordinates, partial covariant derivatives 
along distributions over the base manifold are formed, where the distributions (subbundles) decompose the 
base manifold's tangent bundle into a direct sum. Such a construction conveniently captures the geometry 
of maps with respect to the geometry of its domain. 

Proposition 9.4 (Partial covariant derivatives). Let L E C°° (M, M), and for each i E {I, . . . , n} let Fi ^ M 
be a smooth vector bundle. If, for each i G {1, . . . , n}, Ci G F (Fi iSim T* M) such that ci (Bm ■ ■ ■ ®m c„ G 
r {{Fi ®M • • • (Bm Fn) ®M T* M) is a smooth vector bundle isomorphism over Id^, then there exist unique 
sections G F (F*) for each i G {1, . . . ,n} such that 

yM-+R^ = ixi -Fi Ci H h L^c„ ■F„ C„. 

This decomposition ofVL provides what will be called partial covariant derivatives of L (with respect to 
the given decomposition) . 

Proof. The following equivalences provide a formula for directly defining L , . . ■ , L,Cn- 

L,ci -Fi Ci + • • • + L^c„ -Fn Cn 

{L^ci ®M ■ ■ ■ ®M L^c„) ■Fi(BM--®MFn. (Cl ©M ' ' ' ©Af C„) 
i,ci ©Af • • • ©A'/ L,c„ ■ 

Existence and uniqueness is therefore proven. □ 

Corollary 9.5 (Horizontal/ vertical derivatives). Let h := V tt G T {'k*TM ®eT* E) as before. If v £ 
F (tt* E ®eT*E) is a connection map, and if L: i? — > M is smooth, then there exist unique L ^ G F (7r*r*M) 
and L^y G F {Tr*E*) such that VL = L^h -tt'tm h + L ^ e v. 



VL 

VL 

-1 



TM 



(ci (Bm ■ ■ ■ ©M Cn) = 
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It should be noted that the basepoint-preserving issue discussed in Section 7 plays a role in choosing to 
use the tensor field formulation of h: TE TM and v: TE E. In particular, without preserving the 
basepoint (via the 7r-pullback of TM and E to form heV {■k*TM (S)e T*E) and v G T {Tr*Ei»E T*E)), the 
map h v would not be a smooth bundle isomorphism, and the horizontal and vertical derivatives would 
be maps of the form L^h ■ E — )■ T*M and L ^ : E ^ E* , but that, critically, are not sections of smooth vector 
bundles, and can only claim to be smooth [fiber] bundle morphisms. Derivative trivializations will be central 
in calculating the first and second variations of an energy functional having Lagrangian L (see (12.1) and 
(13.1)). 

10 Curvature and Commutation of Derivatives 

A ubiquitous consideration in mathematics is to determine when two operations commute. In the setting of 
tensor calculus, this often manifests itself in determining the commutativity (or lack thereof) of two covariant 
derivatives. Here, "covariant derivatives" may refer to both linear covariant derivatives and the tangent map 
operator (see (7.1)). This unified categorization of derivatives will now be leveraged to show that certain 
fiber bundles are fiat (in a sense analogous to the vanishing of a curvature endomorphism) with respect to 
particular covariant derivatives. This reduces the work often done showing commutativity of derivatives in 
the derivation of the first variation of a function in the calculus of variations to the simple statement that a 
particular tensor field is symmetric, which is comes as a corollary to the aforementioned fiatness. 

In this section, the symbol V may denote V or V , depending on context. This eases the expression of 
repeated covariant derivatives, such as the covariant Hessian of a section (see below), and is an example of 
telescoping notation as discussed in Section 3. 

If tt: i5 — >■ A/ defines a smooth [fiber] bundle whose space of sections F [E) has two repeated covariant 
derivatives defined and if V"^*^ is a symmetric linear covariant derivative (meaning V xY — VyX — [X, Y] 
for X,Y E r (TM)), then the tensor contraction 

V^a : {X (E)mY -Y (E)M X) 

is an expression measuring the non-commutativity of the X and Y derivatives of a. The quantity V^tr will 
be called the covariant Hessian of a, because it generalizes the Hessian of elementary calculus; it contains 
only second-derivative information, and in the special case seen below, it is symmetric in the argument 
components. It should be noted that if F — > M is the vector bundle such that Vu E T {F ®m T*M), then 
V^cr G r {F (E)M T* M T* M). Intentionally leaving the V and • symbols undecorated in preference of 
contextual interpretation, unwinding the expression above gives 

V^cr ; {X ®M Y ^Y (g)M X) = Vy^o- ■ X - Vct • Y 

= VyVxCr - Vcr • VyX ~ Vx^YCr + Vd • V xY 
= -VxVyCr + Vy VxCT + Vcr • [X, Y] 
= -VxVyCr + Vy VxO- + '^[x,Y]^, 

which is syntactically identical to the common definition for the [Riemannian[ curvature endomorphism 
R {X, Y) a. In the traditional setting, where is a linear covariant derivative on vector bundle E, the 
curvature endomorphism takes the form of a tensor field e T [E ®m E* ®m T* M ®m T*M). In this 
setting however, because may be nonlinear, such a tensorial formulation doesn't generally exist. Instead, 

i?^ {X, Y) := -VxVf + Vy Vf + V^^y] 

defines a second-order covariant differential operator ("covariant" meaning tensorial in the X and Y compo- 
nents). Put differently, 

R^ {X, Y)a^V'^(j:{X®MY -Y ®m X) , 
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which will be called the (possibly nonlinear) curvature operator, measures the non-commutativity of the 
X and Y derivatives of a. If is identically zero, then the bundle E is said to be flat with respect to the 
relevant connections/covariant derivatives. 

There are two particularly important instances of flat bundles. The first is the trivial line bundle defined 
by 1^^^^ (whose space of smooth sections, as discussed in Section 4, is naturally identified with C°° (S", M)). 
In this case, V^^^/ e V {T*S), and V^/ = V ^'■^ V ^^^f € V {T* S ®s T* S) is the object referred to in 
most literature as the covariant Hessian of /. Here, R^~^^ (X, F) / is a real-valued function on S. 

Proposition 10.1 (Symmetry of covariant Hessian on functions). Let S be a smooth manifold and let V^'^ 
be a symmetric covariant derivative. If f G C°° {S,M.), then V^/ € T {T* S (E}s T* S) is a symmetric tensor 
field (i.e. it has a (12) symmetry). Here, the covariant derivative on is V'^"*''^ as defined above. 

Proof Let X,Y eT (TS). Recall that Vf = df eT {T*S). Then 

: {X (E)sY -Y (E)s X) 
= VYVf -X-VxVf -Y 

= Vy (V/ ■X)-Vf- VyX - Vx (V/ • r) + V/ • VxY 

= V {Vf ■ X) - Y -V (Vf - Y) ■ X + Vf -[X, Y] (by symmetry of V"^^) 

= - V/ • [X, Y]+Vf ■ [X, Y] (by definition of [X, Y]) 

= 0. 

Because X(^sY is pointwise-arbitrary in TS ^sTS, this shows that V^/ is symmetric. Equivalently stated, 
jjS-i-R jg identically zero, and therefore the relevant bundle is flat. □ 

The second important case involves the nonlinear covariant derivative v*^^'^ on {M,S). Here, if 
e C°° (M, S), then 

so {X, Y)(f)£T {(l)*TS). 

Proposition 10.2 (Symmetry of covariant Hessian on maps). Let M and S be smooth manifolds and let V^^^ 
and V"^^ be symmetric covariant derivatives. If <j) & C°° (A/, S), then V^cf) G T {(t)*TS (8)m T*M®m T* M) 
is a tensor field which is symmetric in the two T* M components (i.e. it has a (2 3) symmetry). Here, the 
covariant derivative on C°° {M, S) is V^^^'^ as defined above. 

Proof. Let X, y e F {TM) and / e C°° (5, M), so that 0* V/ G V {<j)*TS). Then 

= - Vx (</'*v/ ■vcj)-Y) + Vxr V/ • V0 • y 

+ Vf {(t)*Vf -Vc/j-X)- VY(l)*yf ■V<j)-X 

+ <^*v/-V0-[x,r] 

= - Vx (V0*/ • y) + Vy (V0*/ ■X) + Vcj>*f- [X, Y] 

+ ((/)*vV • V(/i • X) • V(/i • y - ((/)*vV • V0 • y) • V(/) • X 
= - V (vr / • y) • X + V (V0*/ • X) • y + V(/.*/ • [x, y] 
- 0*v2/ : ((V0 • X) ®M (V(/> • y) - (V0 • y) (v^ • x)) . 

By definition, -V (V(/)*/ -Y)- X + V {V(i)*f ■ X) - Y ^ -^4>*f ■ [X, Y], which cancels out the other term. 
By (10.1), V^/ is symmetric, so the final term is zero. Because 4>*'Vf is pointwise-arbitrary in (j)*T*S and 
X and y are pointwise-arbitrary in TM, this shows that R^^~^^ is identically zero, so the bundle defined 
by TT^/*^ : S X M ^ M, whose space of sections is identified with C°° (M, S), is flat, and therefore V^(f) is 
symmetric in its two T* AI components. □ 
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The construction used in (9.4) can be applied to nonlinear as well as linear covariant derivatives to 
considerable advantage. For example, if ip: M x N ^ L, where M,N,L are smooth manifolds and pM ■= 
prf ^ and PAT := prf ^, then define m e T {i;*TL (giMxN pIjT* M) and ?/' w e T {^I:*TL (giMxN P*nT* N) 
by 

This gives a convenient way to express partial covariant derivatives, which will be used heavily in Part II in 
calculating the first and second variations of an energy functional. Note that in this parlance, 'ip^(MxN) is 
the full tangent map V ip. 

Defining second partial covariant derivatives ip^MMi '>P,mNj '^,nm S'lid tJj.nn by 

VV',M = 'ip.MM ■ ^ PM + i'MN ■ ^ Pn and 

Vlp^N = 'Ip.NM ■ V PM + i^MN ■ ^ PN, 

the symmetry of the covariant Hessian of can be used to show various symmetries these second derivatives. 

Proposition 10.3 (Symmetries of partial covariant derivatives). With ^ and its second partial covariant 
derivatives as above, 

^.MM &r{^*TL(g,MxN P*mT*M ®MxNP*mT*M) 

and (having analogous type) are {2 3) -symmetric (i.e. {4',mm MM and {iP.nn) = '4',nn) 

and the mixed, second partial covariant derivatives 

i^MN e r {i;*TL ®MxN P*mT*N ^mxn P*nT*N) and 
i^,NM eT{iP*TL®MxNP*NT*N ®mxnpIiT*M) 

are mutually {2 3) -symmetric (i.e. 4',mn — (V'-./vm) )■ 
Proof. Let X,Y eT {TM TN). If TpN • X = and TpM -Y ^0, then 
= V^V- : {X ^MxN Y-Y ^MxN X) (by (10.2)) 

= V'.MM : ( VpM • X (g>MxN ^ PM Y - VpM -Y (g)MxN ^ PM ' X) 
+ ■0,MAr -.{"^PM-X ®MxN PN 'Y - V PM 'Y ®MxN Pn ■ X) 
+ ^.NM -.{^Pn-X (»mxN VpM-Y -VpN-Y ®MxN ^ PM ' X) 
+ ^.NN ■■ C^PN ■ X (g)MxN Vpjv - y- '7pN-Y(g)MxN PN ■ X) 

= i^MN ■ {V PM ■ X ®MxN V PN - Y - V PN -Y ®MxN V PM ' X) 

= {l^^,MN - ('/'.ATm)*^^^) : {V PM ■ X <g)MxN V PN ■ Y) . 

Because V pm ■ X and V pj\j ■ Y are pointwise-arbitrary in p\,jTM and p*^TN respectively, this implies that 
ip,MN = {iP,nm) ■ Analogous calculations (setting V pm ■ X — Q and V pm • ^ = and then separately 
setting VpTv • ^ = and V pn ■Y = 0) show that ^^mm — i'<P,MMY'^^^ and ^_nn ~ ('0,Af7v)*'^^''- □ 

There are two final results regarding the second covariant derivative that will be especially useful in the 
calculation of the first and second variations of an energy functional (see (12.1) and (13.2)). 

Proposition 10.4 (Chain rule for covariant Hessian). Let n: E ^ N define a bundle having a first and 
second covariant derivative (i.e. a section of E can be covariantly differentiated twice). If (j): M ^ N and 
e e r(£:), then 

V'^(j)*e = cj)*V'^e -.^^TN (V^^Af V 0) + Ve -^.tat VV 
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Proof. Let X € r (TAf). Then 



Vx (0* Ve) -^^TN V + 0* Ve -^.tw Vx V 

(0* V^e -^.TW V • X) -^.TAT V + (/i* Ve -^.tat Vx V 

[(/.*V'e :^.TJV (V0Hm V 0) + 0* Ve -^.t^v VV 0] -X 



Because X is pointwise-arbitrary in TM, this estabhshes the desired equahty. 



□ 



Proposition 10.5 (Fullback curvature endomorphism) . Let tt: E ^ N define a vector bundle having first 
and second covariant derivatives. If (j>: M N, then R'^ = (j>*R'^^ -(p'TN (^0^M ^0)- 



Proof Note that i?"^'™ e T {(l)*TN ®m <t>*T* N ®m T* M ®m T* M). Let X,y e F (TAf ) and let Z e 
F {TN), so that 0*Z e F {(j)*TN). Then 



and because X, K and (p* Z are pointwise-arbitrary in their respective spaces, this establishes the desired 



A common operation is to evaluate a covariant derivative along a single tangent vector. One can express a 
single tangent vector as a section of a particular pullback bundle, the map being the constant map evaluating 
to the basepoint of the vector. This allows the richly-typed formalism of pullback bundles to be used to 
evaluate derivatives at a point, particularly noting that this safely deals with the overloading of the natural 
pairing operator • (see Section 5). 

Proposition 10.6 (Evaluation commutes with non- involved derivatives). Let A and B be smooth manifolds 
and let a r (E) for some smooth bundle E ^ A x B having a covariant derivative V^. If b E B and the 
map z : A ^ A X B , a 1-^ (a, b) represents evaluation at b, then 




\TM {X ®M y) 



\/^q^*Z:TM {XAmY) 

rV^Z -.^.TN ( V Mm V ^) :tm {X Am Y) 

+ (j}*^Z -^-TN^Vcl) :tm {X Am Y) 

0* V^Z i^.TW (( V • X) Am ( V • y)) + cb*VZ -^.tn 

R^^)) -rTN ((V0-X) ®m{^4>-y)) 

(ld^*TN<»M(l>*Z) -^'TN^MrT'N <I>*R^^ -rTN ( V Km V <?!>) iTAf {X ®mY), 




equality. 



□ 




Proof. Let X e F {TA), and let := prf and pb 



.AxB 



. Then 




and because X is pointwise-arbitrary in TAf, this implies that z*(7_a = a desired. 



□ 
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Proposition 10.7. Let A,B,C be smooth manifolds, let ip : A x B ^ C be smooth, let pa '■= wt^^ o.'i^d 
Pb :=pr^'''^, and let X,Y €T{TA®TB). If V Pb ■ X = and '^pA-Y = 0, then 

tAB : ((VPA • X) ^AxB ( VPB • Y)) = Vf ^^V^^^^^^. 

Proof. The conditions V ps • X = and ^ pa ■ Y = imply that VyX = in the product covariant 
derivative. Then since pa XaxbPb = Id^xs, it follows that 

VVpA ®AxB VVpB = V(VpA ®AxB ^Pb) = VV {pa XaxbPb) = VIAta®tb = 0, 
and therefore 

Vy{^Pa-X) = Vy^Pa-X+ VpA ■ VyX = Q-X+ V pa -0 = 0. 
For the main calculation, 

^,AS : (( VpA • X) ®AxB {"^Pb ■ Y)) 
= (V'.As 'p'gTB ^ Pb -taqtb Y^ 'p'aTA ^ Pa -taqtb X 
= ^V^^'^'^^^'^V.a) 'p'a^a Pa -tabtb X 
= Vt (V-.A • VPA • X) - V,A • V?.^ ( VPA • X) 



(since ^ pa ■ Y = 0) 
(by reverse product rule) 
(since Vy ( Vp^ • X) = 0), 



as desired. 



□ 



36 



Part II 

Riemannian Calculus of Variations 



The use of the Calculus of Variations in the Riemannian setting to develop the geodesic equations and to 
study harmonic maps is quite well-established. A more general formulation is required for more specific 
applications, such as continuum mechanics in Riemannian manifolds. The tools developed in Part I will now 
be used to formulate the first and second variations and Euler-Lagrange equations of an energy functional 
corresponding to a first-order Lagrangian. In particular, the bundle decomposition discussed in Section 9 
will be needed to employ the standard integration-by-parts trick seen in the formulation of the analogous 
parts of the elementary Calculus of Variations. The seemingly heavy and pedantic formalism built up thus 
far will now show its usefulness. 

In this part, let {M,g) and {S,h) be Riemannian manifolds with M compact. Calculations will be done 
formally in the space C°° (Af , S) , noting that its completion under various norms will give various Sobolev 
spaces of maps from M to S, which are ultimately the spaces which must be considered when finding critical 
points of the relevant energy functionals. See [4, ")] for details on the analytical issues. Let dVg denote 
the Riemannian volume form corresponding to metric g, and let dV g be the induced volume form on dM . 
Let l: dM — )■ M be the inclusion, and let v e T {l*T* M) be the unit normal covector field on dM . Let 
E := TS ®sxM T*M and tt :— iVg^ 'SSsxm ""m making n: E S x M a, vector bundle. 

The energy functionals in this section will be assumed to have the form 



where L: E ^ W, referred to as the Lagrangian of the functional, is smooth. Here, V (f) could be understood 
to take values either in E = TS(^sxmT*M or (j)*TS ®mT* M . In the former case, the composition Lo V0 
is literal, while in the latter case, there is an implicit conversion from (t>*TS ®m T*M to TS (E)sxm T*M via 
a fiber projection bundle morphism (see (6.2)). Either way, L o V 0: i\/ ^ M. Let V"^"^ and V"'"*^ denote 
the respective Levi-Civita connections, which induce a covariant derivative on E (see (8.9)). Define the 
connection map w € F {n* E ®e T* E) using as in (9.2). For convenience, the S x M subscript will be 
suppressed on the "full" tensor product defining E from here forward. 

11 Critical Points and Variations 

One of the most pertinent properties of an energy functional is its set of critical points. Often, the solution to 
a problem in physics will take the form of minimizing a particular energy functional. Lagrangian mechanics 
is the quintessintial example of this. This section will deal with some of the main considerations regarding 
such critical points. 

Because the domain of a [real-valued] functional C may be a nonlinear space, the relevant first derivative 
is the [real- valued] differential dC, which is paired with the linearized variation of a map </> e C°° {M, S). In 
particular, a one-parameter variation of is a smooth map $ : M x / — > S*, where the / component is 
the variational parameter. Letting i denote the standard coordinate on /, the linearized variation is then 
(5i$: M TS, recalling that Si := ^ |i=o- Because oSi^ = (j), it follows that Si^ e F {(j)*TS), i.e. Si^ is 
a vector field along cj). The object (5,;<i> will be called a linearized variation. Call the elements of F {<j)*TS) 
linear variations. 

Proposition 11.1 (Each linear variation is a linearized variation). Let cxp: U ^ S denote the exponential 
map associated to V^'^, where U C TS is a neighborhood of the zero bundle in TS on which exp is defined, 
and let A: TS x M — > TS, (s, e) i— > es denote the scalar multiplication structure on TS. If A Cz T {(j)*TS) and 



C:C°^{M,S) R, 
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if^:U^Sis defined by $ :— expoAo (A x Id/) then 5^$ = A. In other words, every vector field over 
<j) is realized as the linearization of a one-parameter variation of (j). 

Proof. The map $ is well-defined and smooth by construction. Let p G M . Then 

= 5i (exp oA o (yl X Id/) (p, i)) 
= V exp-(5,(A(A(p),i)) 
= V exp-(5, (iA(p)) 

= V exp. -^(P) 

where Z {tt* E) denotes the zero subbundle of i:*E. The last equality follows from a naturality property of 
the exponential map [ , pg. 523]. □ 

Thus each linear variation is a linearized variation, establishing a natural identification of {C°° {M, S)) 
with r {(f)*TS), which will be useful when calculating the differential of a functional on C°° (M, S). In fact, 
the exponential map construction in (11.1) is a way to construct charts for the infinite dimensional manifold 
C°° iM,S) Theorem 5.2]. 

12 First Variation 

This section is devoted to calculating the first variation of the previously defined energy functional. Here 
is where the full richness of the type system of the objects developed earlier in the paper will really show 
their power (and arguably, necessity). While the type-specifying notation may appear overly decorated and 
pedantic, subtle usage errors can be detected and avoided by keeping track of the myriad of types of the 
relevant objects through the sub/superscripts on covariant derivatives and natural pairings; extremely com- 
plex constructions can be made and navigated without much trouble. By contrast, performing the ensuing 
calculations in coordinate trivializations would result in an intractible proliferation of Christoffel symbols 
and indexed expressions which would prove difficult to read and would be highly prone to error. 

Because the Lagrangian L : — M is defined on a vector bundle tt: E ^ S x M over the product space 
S X M , the decomposition in (9.5) can be slightly refined. The projection tt can be decomposed into the 
factors TTs '■— Pi'f^^"^ ^^nd t:m :— pr^^*^ ott, so that tt = tt^ x^; ttm- Then h = V tt = V Trg ®e V ttm- 
Let 

CT V TTs e r {tt*sTS ®e T*E) and ii := V ttm e T {ttIjTM ®e T* E) . 
The letters sigma and mu have been chosen to refiect the fact that L^^ G L {■K*gT* S) and L.^ e F [tt^jT* M) 
give the "S* component" (spatial) and "M component" (material) of the derivative V^^^L e F {T*E). The 
connection map v will be retained as is, giving L^y g F {tt*E*), the "i? component" (fiber) of V^~*''*L. See 
(12.5) for a discussion of how the quantities L.^, L g., L,v generalize the analogous structures in the elementary 
treatment of the calculus of variations. 

Because a one-parameter variation of e C°° (M, S) has the form <I> : M x / — > 5 but the energy 
functional C involves only the M derivative of its argument, the partial tangent map must be used here. For 
the purposes of calculating the first and second variations, C must be written as 

£(0) := / Lo,p^MdVg. 

Theorem 12.1 (First variation of £). Let L, L, a, jj,, v and v all he defined as above. If 4> ^ C°° {M,S) 
and A e F(0*TS'), then 

(i£(0)-A= / A-^.T*s {4>*ML,a - divM {(I)*mI^,v)) dVg + / A -^^t'S 4>*mL,v -T'M v dV g. 

JM ' ' JdM 
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The expression above is often called the first variation of fL. A type analysis here gives (p^^j^jL^^ G 
r {(j)*T*S) and (jj^jL^,, € T {4)*T*S®m TM). Recall that because the domain of </) is M, V (j) = (I)^m- 

Proof. Supporting calculations will be made below in lemmas. Let $ : Af x / — > S" be as in (11.1), so that 
(5i$ — A. For tidiness, let L^o- := (jfj^L^„ and L^.„ := (f>*j^iL^y. Then 

d£ {(I)) ■ A ^ dC {(l>) ■ 6^<i> 



M 



5, [L o $^m) dVg 



- / L,, -^.Ts A + L,„ -^.ts^mT-a/ V^'^^AdVg (by (12.2)) 

J M 

^ A -^.T's (L,^ - divM L.y) + divM [A -^-^t-s L,y) dVg (by (12.2)) 

J M 

= I A -^.T'S (L,o- - divM L^y) dVg + / A -^^t'S L,y -T'M i^dV^g (divergence theorem), 

Jm JdM 

as desired. 

As for the types of (f>*j^jL a- and cffj^jL y, the contravariance of bundle puUback allows significant simplifi- 
cation. Because L,^ e T {'K*gT* S) and L_,, e r(7r*_E*), 

Ca/^,. e r (0*jv/^Sr*5) = r ((tts o 0^m)* t*s) = r (0*t*5) and 

(t)*ML,y^T{(t)*MTT*E) ^T{{-KO(f)^M)* {T*S®TM)) = T {4>*T* S (E)m TM) . 



The supporting calculations follow. Define z: M ^ M x I, to i-> (to, 0) for purposes of evaluation of i = 
via precomposition as in (10.6). Then Si is a section of a puUback bundle; 6i = z*di E T {z* {TM TI)). It 
should be noted that $ o 2 = by definition, and that z*^^m ~ (z*$) = (t>,M by (10.6). □ 

Lemma 12.2. LetL, $, yl, cr, andv be as in Theorem 12.1. The variational derivative of Lo^ j^j decomposes 
in terms of the partial covariant derivatives L ^ and L,„ and the linearized variation A; 

Si [L o $_m) = 4>*ML,a ■4,'TS ^i"^ + (t>*ML,v •(0XmMm)*E V"*5i$. 

T/ie integration-by-parts trick as in the derivation of the first variation in elementary calculus of variations 
generalizes to the covariant setting; 

l',cr -cp'TS ^ + -^'ts^mT'm V^A = A -^-T'S (L^o- — divjv/ L_„) + divjv/ {A -^'T'S L,«) . 
Proof. A wonderful string of equalities follows. 

(5, (L o $^m) 

= z*V*^><^^« (L o $^m) -..(TA/eT/) 5» (here, 5, = z*d,) 

= z*$:^,V^^«L -..^..^^^TB ^* V ^'""'^^^M -z'iTM^Ti) (chain rule) 

= <;**Af (■^,<T -TrjTS + vj.TA/ M + -^,t, 'Tr-i; ■0;-j„T£; Si^^M (by (9.4) and because o z = (/)_j,,^) 

= ^*mL,^ -^.ts + (t>*ML,y •(0XMid„)*iJ V^^,$ (by (12.3)) 
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Note that by (6.3), '^*mTt*sTS = (tts o <^>^m)* TS = <P*TS, ^^^t^m™ = i^M o <t>MT TM = {prl'^''^)* TM 
and $\^7r*i? = (tt o $,m)* = ($ x mxi P^m^^)* -E- Replacing 5^$ with A gives 

(i o ^^m) = L,<T -^'TS A + -cf'TS^MT'M V^A, 

establishing the first equality. 
For the second. 

L,(T -ip'TS A + L^y -(/i'ts^mT'M V^A 

= L^CT -^'TS A + try* A/ {L,v '(p'TS 

V^A) (t racing TMseparately) 

= A -^.T'S L,a + ti-T'M (V (L,„ -^.TS A) ~ (V^^'^^^^L,^) -^-ts ^) (reverse product rule) 

= A -^.T'S L,<T - A -^.T'S tTT'M V'^'^^^^^'L^y + trT-Af V {A -^^t-s L,t,) (^.Tscommutes with trr-u) 

= A -^'T'S (L.(T - divA/ L_t,) + divAf (A -^'T'S L,t;) (definition of divA/). 

Note that L,^, e F (0*T*S' (g)Af TM), so divA/ L,„ e F {(f>*T*S) and A -^.t-s L,„ e F (TM). □ 
Lemma 12.3. The variation Si^,M decomposes as follows. 

<I^*M'^ ■r,,TE k'^M = <5,$ e F {^*TS) , 
<t>*M^' ■r,,TE k'^M = e F (tm) , 
•^^A/^^ -r^TiJ k'^M = V^'^^5,$ e F {^*TS ®M T*M) . 
Proof. Fhis calculation determines the cr component of (5i$.A/. 

= Si (tts o ^^m) 

= <5,(prf *^o^o$m) 

= (5,$ e F (z*$*TS') ?s F {<j)*TS) . 

Fhis calculation determines the ^ component of Si^^M. 

'/'*AfM ■'P'mTE Si^^M = 0*Af V TTAf -cli'j^jTE k^ ,M 

= (ttai o $,aj-) 

= 5, (prf/^^o^o$,M) 

= e F (z* (prirO* ^^^) - r (J^Af) . 

Fhe last equality follows from the fact that prj^J^^ does not depend on the i coordinate. 

Fhis calculation determines the v component of (5i<i>,A/. Let pM '■— P'^m^^ Pi pr*^^^- The 
left-hand side of the third equality claimed in the lemma will be examined before evaluating at i = 0; 
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Let r e r (TM), noting that pljY € T {pljTM). Then 
{^*M^ ■'fyiTE 9.<$,A/) -pi^TAipliY 

= ('i?.*// -p'TI P*ldi) -pljTM P*mY 

= (*,/M p'.ta/Pm^) ■p',TiP*A (by (10.3)) 

= -p-TI P*A) -pl.TMPllY ~ $,/ -pjT/ ((pl9i),M -pI^TAf Pm^) 

= (9i$) 'pIjTM P*mY (smce pjdidoesn't depend on M). 

Recall that Id a/ — pm ° z and that the puUback of bundles is contravariant. Then evaluating at i = via 
puUback by z renders 



( 



4'*m'" '4>'m'^E Si^,Mj -TM y 

= {{^M o z)* V -{^ ^jozyTE z*9i$,A/) -{pMozYTM {pm o z)* Y 

= {z*^*mV ■z'<i'j^,,TE Z*di^^M) -z'pIjTM z*plfY 



= {{'^*M^ ■'fyiTE a»$,A/) -pl^TM pllY^j 
= Z* ((9^«')_M -pl^TMPllY^ 
= Z* {di^),M -z'pl.TM Z*P*mY 

= {z*d,<^) j^j -(pMozYTM {pm ° z)* Y (by (10.6)) 

= (5i$)_j,j -TM Y 
= W^'^^S,<i> -TM Y 

The last equality is because e r{(f)*TS), which is a bundle over M, and therefore (6i^) is the 
total covariant derivative. Because Y is pointwise-arbitrary in TAI, this implies that 0*^^ '(p'^jTi Si^.M = 
y0 TS^^(j)^ j |-]^g variational derivative 6i commutes with the first material derivative, just as in the 
analogous situation in elementary calculus of variations. □ 

Corollary 12.4 (Euler-Lagrange equations). // e C°° ( A/, 5) is a critical point of C (i.e. if dC {(f)) ■ A ~ 
for all A eT{(t)*TS)), then 

(f>*ML,a - divAf {(t>*ML^y) = on M, 
c/i^jL^y -TM V — Q on dM. 

These are called the Euler-Lagrange equations for the energy functional C. Recall that because the domain 
ofcj) IS M, '7(f> = (f>M- 

Proof. This follows trivially from (12.1) and the Fundamental Lemma of the Calculus of Variations [ , pg. 
16]. □ 

It should be noted that the boundary Euler-Lagrange equation is due to the fact that the admissible 
variations are entirely unrestricted. If, for example, the class of maps being considered had fixed boundary 
data, then any variation would vanish at the boundary, and there would be no boundary Euler-Lagrange 
equation; this is typically how geodesies and harmonic maps are formulated. 
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Remark 12.5 (Analogs in clcincntary calculus of variations). The quantities L.^^, L,o-- -^.r generalize the riuan- 
tities J If J §1 respectively of the elementary treatment of the calculus of variations for energy functional 



(/:f/^Mn)^ f Lix,f{x),Df{x)) 
Ju 



dx, 



where U C is compact and ?7 x M" x K^^" 9 [x, z,p) ^ L (x, z,p) is the Lagrangian. Here, ^ : U 



X 



dL 



, -g^-.UxWx K™^" ^ M", and II : X K" X M™^" ^ M™^" decompose the total 



derivative dL and are defined by the relation 

,r / X / \ , . dL , , i9L , , 

dL (a;, z,p) ■ [u, v,w) = — [x, z,p) ■ u + — [x, z,p) ■ v + — {x, z,p) : w 

for u e R™, e M", and w e R™^". The Euler-Lagrange equation in this setting is 

BL dL\ 

— - dive/ — j (x, f (x) , Df (x)) = for a; € C/, 
noting that the left hand side of the equation takes values in R". 



In most situations involving simpler calculations, it is desirable and acceptable to dispense with the 
highly decorated notation and use trimmed-town, context-dependent notation, leaving off type-specifying 
sub/superscripts when clear from context. 

Proposition 12.6 (Conserved quantity). If M is a real interval, (p G C°° (M, S) satisfies the Euler-Lagrange 
equation, and L^^ = 0, then 

H:={^cj>)* L,, ■4,,TS^^T'M V <^ - ( V 0)* L e (M, K) 

is constant. If L is kinetic minus potential energy, then H is kinetic plus potential energy (the total energy), 
and is referred to as the Hamiltonian. 

Proof. Let t be the standard real coordinate. Note that because M is a real interval. 'V cj) = cj)' ®m dt. 
Terms appearing in the derivative of H can be simplified as follows. Note the repeated V derivatives; 
V M (j)*TS(^M T*M but VV ?!>: M ( V ^ii)* T {cjfTS (^m T*M) lands in a higher tangent space. 

(V0)V - VV(/.- I = (V<^)* Vtts- VV0- ^ = |(7rso V</>) =0', 

\ -r, ^ dt ^ ' dt 

V. (V</))*L = (V(/.)*Vi: - V v</.-4 
^ dt 

= ( V 0)* L,„ -cp' + {V4>)* L^y • V 
V_d ((V0)*L„ : V0) = (V0)*L,^ : {(j)' ®m dt) + {V (/))* L^^ ' '^ai^'^ 
= (v i"^ (j))* L ^ ■ dt) •(/>' + (V(/))*L^: Va.V(?!-. 

\ dt J dt 

Again, because M is a real interval, the divergence is just the derivative, so the Euler-Lagrange equation is 

O = (V0)*L,-divM((V0)*L,) 
= (V(/.)*L<.-V(V</))*i„ : (^dt<»M 
= L^„-S7a. {V (j))* L^^-dt, 
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and therefore W jl {V (/))* L ^ ■ dt ^ {V <j))* L,„. Thus 

Vj_H^Vjl ((V0)*L„ : V0 - (V0)*L) = (v j_ {V cf))* L ^ ■ dt ~ {V cf))* L ,„\ ■ cf)' 

dt dt ^ ' ^ \ dt ' ' / 

which is zero because (f> satisfies the Euler-Lagrange equation. This shows that H is constant along sohitions 
of the Euler-Lagrange equation, and is therefore a conserved quantity. It should be noted that this proof 
relies on the fact that the divergence takes a particularly simple form when the domain M is a real interval; 
the result does not necessarily hold for a general choice of M. □ 

Example 12.7 (Harmonic maps). Define a metric 

k£r{E* ®sxM E*) ^ r {{TS T*M) 05XA/ {TS T*M)) 
in a manner analogous to that in (3.4); 

k ■.= hMg~^. 

To clarify, h® g^^ £ T {{T* S ®s T* S) ® {TM ®m TM)), so permuting the middle two components (as in 
the definition of hM g^^) gives the correct type, including the necessary metric symmetry condition. If 
A E, then \A\i^ is the quantity obtained by raising/lowering the indices of A and pairing it naturally with 
A. A useful fact is that Vfc = 0; if u ® w £ TS ® TM, then permutation commutativity (8.12) and the 
product rule gives 

V„e^fc = V„e^ (h M g-^) = V„/i M g-^ + hM Vyg-\ 

which equals zero because h and g^^ are parallel with respect to V^'^ and V"^ *^ respectively. 
With Lagrangian 

L:E^R,A^^\A\l 

and energy functional 

£{(t)):= I Lo V(f>dVg 
Jai 

{£ {(j)) is called the energy of </>), the resulting Euler-Lagrange equations can be written down after calculating 
and It is worthwhile to note that L is a quadratic form A i-^ A : ^k : Aon E, which will automatically 
imply that L ^ (A) — A : k. However, the calculation showing this will be carried out for demonstration 
purposes. 

Let A,B ^TS® T*M. Then e i-^ ^4 + eS is a vertical variation of A, since h{5{A + eB)) = 0, so 

{A):B = {A):v 5^ {A + eB) 
= (5e {L{A + eB)) 

= ,5, (^{A + eS) : i {^*k {A + eB)) : [A + eB)^ . 

The product rule gives three terms. The middle term is zero because tt {A + eB) — tt (A), and therefore does 
not depend on e. The basepoint evaluation notation for TT*k {A) will be suppressed for brevity (see Section 
5). Thus 

Ly(A) : B = B -.-k: A + A:-k: B = A:k: B, 

where the last equality results from the symmetry of k. By the nondegeneracy of the natural pairing on 
TS (E) T*M (which is denoted here by :), this implies that L^^ (A) = A : k. 

To calculate L^^, it is sufficient (and can be easier) to calculate L^h, as h = V tt, tt = tt^ ttm, so 
h = a ®E A*- Let A (e) be a horizontal curve in E = TS (g) T*M; this means that v ■ ^A = 0. Recall that 
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V- f^Ais defined by V^r^^'^A. Then 

L,h {A) -Tr'iTSfBTM) ^ 'TE SeA = (L_^ ^•(TSeTA/) ^ + 'tt'E v) 'TE ^hA 

= VL -TE SeA 

= 5,iLo A) 

^S,(^A: ^A*n*k : A^ . 

As before, the product rule gives three terms. Using the contravariance of bundle puUback, the middle term 
is 

l^(.oA)-(£*»sxMiJ-) „ ^ 1 „ vB-«?sxMiJ-fc .6,(7:0 A), 

which equals zero because Vfc = 0. Thus 

Lj, {A)-h-S,A^Vs^A:^k:A + A:^k: Vg.A, 

which equals zero because V^^^ = v ■ S^A = 0. The quantity h ■ S^A can take any value in tt* {TS © TM), 
showing that L^h = 0. Finally, h = a (Be fJ' implies that L a- = and L.^ = 0. This can be understood from 
the fact that L depends only on the fiber values of A, and has no explicit dependence on the basepoint; this 
relies crucially on the fact that V/c = 0. 

Finally, the Euler-Lagrange equations can be written down. Recalling that the natural trace of a tensor 
(used in the divergence term in the Euler-Lagrange equation) is contraction with the appropriate identity 
tensor, let (ci) be a local frame for TM and let (e') be its dual coframe, so that (^m is a local expression'"' 
for IdxM G r {TM (g)M T*M). The type-subscripted notation will be minimized except to help clarify. On 
M: 

0= (V(^)*i,-divM((V(^)*L,) 
= - tr V (V : fc) 

= -Ve. ( V^-.k) -T'Me" 

= -Ve^ V(/) -.k-e'- V (/) : Ve^fc • e*. 

The second term vanishes because Vfc — 0. Unraveling the definition of fc gives Ve^ V cf) : k — (f)*h ■ Ve^ V (p ■ 
g^^. Contracting both sides of the above equation with —(j)*h^^ gives 

Ve, V • g-^ . = Ve. V • e, = tr^ V^^ e F (</)*TS') . 

The quantity tr^ V'^cj) is the g-trace of the covariant Hessian of (f) and can rightfully be called the covariant 
Laplacian of (j) and denoted by Ag(/) (this is also referred to as the tension field of (f) in other literature 
[2 , pg. 13], which is denoted r (0)). Note that Ag0 is a vector field along 0. This makes sense because 
is not necessarily a scalar function; it takes values in 5*. In the case S" = M, Ag0 is the ordinary covariant 
Laplacian on scalar functions. 

A harmonic map is defined as a critical point of the energy functional £(0) :— Jj^^ ^ I V^lfe dVM- 
Assuming a fixed boundary (so that the variations vanish on the boundary) eliminates the boundary Euler- 
Lagrange equation, the remaining equation is 

Ag(f) = on the interior of M, 

which is the generalization of Laplace's equation. Satisfying Laplace's equation is a sufficient condition for a 
map to be a critical point of the energy functional. There is an abundance of literature concerning harmonic 
maps and the analysis thereof [2, 6, 14, 20]. 

''It should be noted that while Utm is being written as the local expression e*i no inherently local property is being 

used; this tensor decomposition is only used so that the product rule can be used in the following calculations in a clear way. 
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Example 12.8 (The geodesic equation). A fundamental problem in differential geometry is determining 
length-minimizing curves between given points. If M is a bounded, real interval, and t denotes the standard 
real coordinate, then the length functional on curves 0: M — >■ 5* is £ {(j>) := Wig dt. A topological metric 
d: M X M ^ M. on M can be defined as 

d {p, q) := inf {C {(f>) \ (j) joins pto q\ . 

It can be shown that the length functional C {(f>) := Jj^j Wlh dt and the energy functional £ (</>) :— Jj^^ ^ |(/)'|^ dt 
have identical minimizers. Note that (/>' £ r((f)*TS). It is therefore sufficient to consider the analytically 
preferable energy functional. 

In this case, the metric g on AI is just scalar multiplication on M. Because M is one-dimensional and t 
is the standard real coordinate, is a global, parallel orthonormal frame for TM, and the g-trace of 
(i.e. Ag(f>) has a single term. The Euler-Lagrange equation, on the interior of M, is 

= A„0 = tr„ = VV(/) : (4,-T-|==VdV0--^==Vd lV</---f-|-V0-Vd-^. 

^ ^ ^ \dt dt) ^ ^ dt ^ \ ^ dt J ^ ^dt 

But V • ^ = 0' and jl^ — 0, giving the geodesic equation 

'^'tt^^'t'' ~ on the interior of M. 

dt 

This is the covariant way to state that the acceleration of (j) is identically zero. The geodesic equation is 
commonly notated as = V^i(j)', though such notation is inaccurate because cf)' is not a vector field on S, 
but a vector field along 0, and therefore use of the puUback covariant derivative V*^ '^^ is correct (see (8.8)). 

While formulated using fixed boundary conditions (0 has p and q as its endpoints), the geodesic equation 
is a second order ODE for which initial tangent vector conditions are sufficient to uniquely determine a 
solution. 

13 Second Variation 

A further consideration after finding critical points of the energy functional C is determining which critical 
points are extrema. This will involve calculating the second derivative of £. Let C := C°° (M,S), noting 
that T^C = r {(p*TS) for (p G C. The first derivative of £ is V'^~*'^£ :— d£, as seen in the previous section. 
The second derivative is the covariant Hessian V"'" "^V*^^"/^, where the covariant derivative V"'" is induced 
by V^'^ [5, Theorem 5.4]. 

For the remainder of this section, let /, J C M be neighborhoods of zero, let i and j be their respective stan- 
dard coordinates, and extend the existing (5-style derivative-at-a-point notation by defining 5i :— ^ \i=j=o, 
5j := |i=j=Oi SiJid evaluation map z : M ^ M x I x J, m i-^ (m, 0, 0). Then Si = z*di and 6.j = z*dj] these 
will be used as in the calculation of the first variation. 

Theorem 13.1 (Second variation of £). Let £, L, a, [i, v and v all he defined as above. If (/) E C°° {M,S) 
is a critical point of £ and A, B E T^C = F {(j)*TS), then the covariant Hessian of £ is 

V^£{^) -.T.c {A(g>B) 

A -c/j'T'S 'py.lL.cra 'ip'TS B + A -^-T'S 4'*mL,(tv ' (p'TS® mT* M V*^ B 

+ '^^ A -(ft'T'S^MTM 0*Af i,-u(T -^'TS B + "^^ A -^^'-T'S^mTM 0*M^,-"f 'tjj'TS^MT'M B 
— A -^'T'S {4>*mL,v -^i'TS^mT'M {(j)* R^^ -(P'TS 0,A/)) -(fi'TS B dVg. 

This is often called the second variation of £. Here, i?^"^ e T (TS <Sis T* S <Sis T* S <Sis T* S) denotes the 
Riemannian curvature endomorphism tensor for the Levi-Civita connection on TS. 
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Proof. Let <I> : MxIxJ ^ 5 be a two-parameter variation such that = A and 5j<^ = B (e.g. $ (m, i, j) := 
exp {iA (m) + jB (m))). The variation $ can be naturally identified with a variation <i> : / x J — )■ C, («, j) 
(to I— >■ $ (to, j, j)) which is more conducive to the use of C as a manifold. The tensor products in the generally 
infinite-dimensional TC are taken formally. Let z := (0, 0) G / x J. 

By (10.4), taking the algebra formally in the case of infinite-rank vector bundles, 

(£o$) =$*v2£ ij.j^c- (v$H/x,/ v$) + $*v/: -j-y^^ VV$, 



{V^C oc 0) :t^c {S,'^ ® -^j^) + (V£ 

(£ Oc $) :T/eTj ((^i «)/x J ^j) 

[ 6,d,iLo<i>^M) dVg 
Jm 

/ (Lo $.m) -TMfSTI^TJ (^i ®M-kI-kJ ' 



(since V£ = 0) 
(by above) 



5,) dV, 



A-^.T 



M 



*TS B + A -Aj'T-S 



'TS 



B 



BdVn 



(by Calculation (1)). 



+ V"^ '^^ A ■4,'-T'-S(S)mTM 4'*mL,vv -^i'TS^mT'M V"^ ^'^B 
— A -^'T'S {4'*mL,v ■4>'TS(iS,mT'M {(j)* R^^ -(P'TS 4',m)) -tti'TS 

Supporting calculations follow. 

Calculation (1): Abbreviate <j>*]^,jL xy by li.xy By (10.4), 

{L o $^A/) :tM(STI(BT,J {^i ^MxIxJ Sj) 

M 



(by Calculation (2)) 



(by Calculation (3)). 
Note that Vf'^'^a^* e T {4)*TS), and since ^ is a critical point of £ 



J M 
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Thus 



/ (L o -TMermTJ {Si 'SiMxixj Sj) dVg 
J M 

JM ' ' • \ J 

+ -^T^S -r^TT-B -^.TS (5^$) -^.TS 0,^/)) dK, 

/ A -cfi'T'S 'i^,a<T -(ji'TS B + A ■^•'T'S L,cru -(ji'TS^MT'M V*^ "^"^-B 
J M 

+ V"* "^■^A ■(P'T'S^mTM L^uct -^'TS 5 + '^^ A ■(P'T'-S^mTM •^•-TS(g)MT'M V* '^'^B 

— A -r/j'T'S ■^*TS(»mT'AI {(jj* R^^ -cji'TS 4>.m)) '4>*TS B dVg 

(by antisymmetry of curvature tensor). 



Calculation (2): 



Z* VV :z*(TAfeT7eTJ) (<5i (8)M <5j) 

""-^^ ' V 3>,M -TMeT/eTJ (,Ci (K)Mx/x J Ojj J 

*\,Ti5(»Mx/xj(T*MeT*/er*,/) 
= V J""'"' ( V <I>,M -TMer/eTj 5,) (since V™®^^®^'^a, = 0) 



Calculation (3): As calculated in the proof of (12.1), 

</':Af^ ■r^.TE <5.$,M = <5,$ e r {(b*TS) , 

'/':mA^ -CmTB '5.$,m = e r (TM) , 

Cm^' -r^TB S,<fM = V^*™,5,$ e r {(l)*TS ®M T*M) . 

Furthermore, letting P :— prj)^^^^'' for brevity and noting that P o z — Wm, 

= 2:*V J"''^^^ •<i.:„T£; a,$,A/) (since Vcr = 0) 

= 2*V|,^^°*'"'^'^^5,$ (using calculation from (12.1)) 

= v|.*'^^a,$ e r (z*$*r5) = r {<j>*TS) , 
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= z*v'^''''^'^^' (ci>;,,^ -^y^TE 9.$.A/) (since = 0) 

= ^*y(^^M°'i'.M)'TAfQ ^^^.^^g calculation from (12.1)) 

= e r (z* (ttm o $^m)* tm) = r {z*p*tm) = r (tm) , 

= 2*V^'^'''' ^ ($*mW -^^tb 9,;$,m) (since Vv = 0) 

— z*Vg^°*'*^^ ^ (di^) ^ (using calculation from (12.1)). 

Note that 

Cm« e r {<j^*,MT:*E ®M ^*mT*E) = V ((0 Xm Mm)* E ®m ^^T* E) , 



and therefore 



*Af« -^VTS V*.*"^^a,$,M e r ((0 Xm Mm)* ^) - T {<i}*TS ®m T*M) , 



so it suffices to examine its natural pairing with TM elements. Let X G T {TM), noting that X = ld\j X 
z*P*X and that P* X = TP-{X® On © Otj) € T {P*TM). Then 



X 



— 2: V^^. {Oi^} ]^^ -z'P'TM Z f A 



- z* ((a,$)_j,, • V^;^*^ ( V P -TMeTieTJ ■ {X © Ot/ © Otj))) 

^* (vS.eo... V|*^^a.$ + V|7ieo„®o...]5.* - ^^'"^^ (5„ ^ ® Ot/ © Ot/) 5,$ 



= z 



- z* |^(Id$. TS ®Mxixjdi^) ■ii.'TSi^Mxiyj'^'T'S -R* '^'^ :tm®t/®t/ [dj '^Mxixj {X © Ot/ © Ot/))) 
V**^^Vf7'^9,$ + (Id^.TS ^M'J.^) ■4>'TS^MrT'S {^*R^^ -rrs S^^) -^^ts <I^,m 



■TM 



X, 



where the last equality follows from Calculations (4) and (5). Because X is pointwise-arbitrary in TAf, this 
shows that 



Calculation (4): 



V ^ / ,A/ 

(Vf7^9,<l>) -TM X (by (10.6)) 

^^'ts^I^'tsq.^ -ta/ ^ (because V|*^^a,$ € T (z*$*TS') T {(l)*TS)). 
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Calculation (5): 

— Z* i^R'^ '^^ -TMiSTlmTJ {dj ®AixIxJ (X © Oti ® Otj))^ 
= z* (i?*'^^ ■■TAmTKBTj {{X © Oti © Otj) ®mx/x,/ a^)) (antisymmetry of i?**^^) 

= Z* :$.TS (V$Hmx/x.7 V$) iTAfeT/eTJ ((^©0T/®0Tj)®Afx/x,7a,)) (by (10.5)) 

= Z* :$.TS ((*.M -P'TAf ^'*^) «)Afx/xJ dj<^)) 

= Z* -^.-TS dj<S>) ■<s,*TS -P'TM P*X) 

= -..^.TS 2*5,$) -..^.TS 2**,Af -z-p.TAf 2*P*^ 

= [(l)*R^^ •^"TS ^j^) -(jy'TS 4>M -TM X. 

□ 

Theorem 13.2 (Second variation of C (alternate form)). Let C, L, a, ^, v and v all be defined as above. 
//(^ e C°° (M, S) is a critical point of C and A, J5 e T {cj)*TS), then 



V2£(0) :t,c {A®B) 

J M 

— A -^'T'S divAf L^„cr -^'Ts B — A -^'t-'S ^.va ■T'-M®M<t>'TS (v*^ B^ 



— A ■^•'T'S divj\,/ L_„„ ■^'ts<»mT'm V"^ '^'^i? 

— A ■cjj-'T'S -tp'TS^MT'M {(t)*R^^ ■<j,'TS 4>,m)) ■4>*ts B dVg 
+ / (A •0*T*S L^tjCT -^-TS S) -T-Af + ( A ■0*T'S •^•rS(»MT*A/ 

Proof. This result follows essentially from (13.1) via several instances of integration by parts to express the 
integrand(s) entirely in terms of A and not its covariant derivatives. Abbreviate (j)*j^jL^xy by L ^.y. Then, 
integrating by parts allows the covariant derivatives of A to be flipped across the natural pairings over (f)*TS. 

y0 -(ft'T'S^MTM L,t)o- -(/I'TS B dVg 

M 

J tr^Af (^(y^ ^^A^ ■^--'TS L,^cr ■^--'TS dVg (TMtrace is taken separately) 

/ tr-TAf (V'^^"' {A -^-T'S L,„o- -(ji'TS B)) 

J M 

— tTTM {a -r T'S V^*^'^«-^*^«-^-^^L,„. .^.TS b) 

— tiTM (^A ■4,*T'S L,i,<T ■4>'TS V^'^-^B^ dVg (reverse product rule) 

/ —A -(ji'T'S divM 'Lt^va '(p'TS B (definition of divergence) 

Jm 

( <h'TS 

— A -cfi-T'S L,^o- •T*M0M<P'TS B 1 dVg 

+ {A -tji'T'S L,„a -^'TS B) -T'M I' dV g (divergence theorem). 

JdM 
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Similiarly, 



y(/> TS^ -^'T'S^mTM L.du -(P-TS^mT'M V*^ B dVg 

M 



= J^^^'TM (^(V**^^^)^''^ ■rT's'L,vv -rrs^^MT-M V**^^B^ dVg 

i-y. (a Y7cl>'T'S(giMTM«,A4<p'T'S(»MTM-T Y7't>'TS r,\ 

— trTM I ^ ■4,'T-s ^ i^,vv ■4>'TS'»mT'M y ts \ 

f 

= / -^-T'S diVM L,Dt) ■rp'TS(»MT' M V'^ ^^B 

J M 

JdM ^ ' 

Together with (13.1), this gives the desired result. □ 

14 Questions and Future Work 

This paper is a first pass at the development of a strongly-typed tensor calculus formalism. The details of 
its workings are by no means complete or fully polished, and its landscape is riddled with many tempting 
rabbit holes which would certainly produce useful results upon exploration, but which were out of the scope 
of a first exposition. Here is a list of some topics which the author considers worthwhile to pursue, and 
which will likely be the subject of his future work. Hopefully some of these topics will be inspiring to other 
mathematicians, and ideally will start a conversation on the subject. 

• There refinements to be made to the type system used in this paper in order to achieve better error- 
checking and possibly more insight into the relevant objects. There are still implicit type identifications 
being done (mostly the canonical identifications between different puUback bundles). 

• The calculations done in this paper are not in an optimally polished and refined state. With experi- 
ence, certain common operations can be identified, abstract computational rules generated for these 
operations, and the relevant calculations simplified. 

• The language of Category Theory can be used to address the implicit / explicit handling of natural type 
identifications, for example, the identification used in showing the contravariance of bundle puUback; 
il,*(j)*F^ (^oi/')*F. 

• The details of the particular implementation of the puUback bundle as a submanifold of the direct 
product M X F are used in this paper, but there is no reason to "open up the box" like this. For most 
purposes, the categorical definition of puUback bundle suffices; the puUback bundle can be worked 
exclusively using its projection maps Trfj^ and Pp^. In the author's experience (which occurred too 
late to be incorporated into this paper), using this abstract interface cleans up calculations involving 
puUback bundles significantly. 

• The type system used for any particular problem or calculation can be enriched or simplified to adjust 
to the level of detail appropriate for the situation. For example, if 7 G C°° (M, Af), then V 7 £ 
F (7*rM (XiR T*M), but if t is the standard coordinate on M, then V 7 = -/''S)mdt, where 7' e F {j*TM) 
is given by V 7 • J^. This "primed" derivative has a simpler type than the total derivative, and would 
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presumably lead to simplier calculations (e.g. in (12.6). This "primed" derivative could also be used in 
the derivation of the first and second variations. While this would simplify the type system, it would 
diversify the notation and make the computational system less regularized. However, some situations 
may benefit overall from this. 

• The notion of strong typing comes from computer programming languages. The human-driven type- 
checking which is facilitated by the pedantically decorated notation in this paper can be done by 
computer by implementing the objects and operations of this tensor calculus formalism in a strongly 
typed language such as Haskell. This would be a step toward automated calculation checking, and 
could be considered a step toward automated proof checking from the top down (as opposed to from 
the bottom up, using a system such as the Coq Proof Assistant). 

• Is there some sort of completeness result about the calculational tools and type system in this paper? 
In other words, is it possible to accomplish "everything" in a global, coordinate-free way using a certain 
set of tools, such as puUback bundles, covariant derivatives, chain rules, permutations, evaluation-by- 
puUback? 

• The alternate form of the second variation (see (13.2)) can be used to form a generalized Jacobi field 
equation for a particular energy functional. Analysis of this equation and its solutions may give insights 
analogous to the standard (geodesic-based) Jacobi field equation. 
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